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Preface to the Second Edition 


This book has now been in print for ten years and has seen several 
printings. During this period it has been very gratifying to me to have heard 
from a surprisingly varied group of readers that it has been helpful to them in 
learning the fundamentals of differentiable manifolds and their geometry. It 
appears that to some extent at least the objectives set out in the original 
preface have been realized. Satisfying though this may be, ten years is more 
than enough time to accumulate a long list of annoying misprints and errors 
and a collection of possible improvements, many of which were suggested by 
users and readers. This makes it a real pleasure to be given a chance to 
rethink a number of proofs, add new problems, and clean up as many errors 
and misprints as could be found. There are, in fact, changes on almost every 
second or third page, some slight and some substantial: a number of proofs 
have been modified or improved; the historical notes have been enlarged; and 
the references have been updated. It is hoped that this has improved the 
readability while preserving whatever good qualities were there before. 

I am very appreciative of the detailed comments and errata furnished by 
Chung-Shing Chen, H. V. Fagundes, J. F. C. Velson, and A. H. Clifford. I am 
profoundly grateful to the many students, colleagues, and friends encoun- 
tered in many places, who have said kind and encouraging things, and to 
those who have taken the trouble to write to me about this book. This is a 
reward for my efforts which is all the more gratifying for having been 
unanticipated. It is hoped that readers of the first edition will find the second 
edition tidied up and improved and that new readers will find it helpful to 
them in learning an exciting and important field of mathematics. 

I also wish to express my gratitude to the National Science Foundation 
for support at various times during which I have worked on this book. 


Lavenen bei Gstaad William M. Boothby 
Switzerland 
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Preface to the First Edition 


Apart from its own intrinsic interest, a knowledge of differentiable 
manifolds has become useful—even mandatory—in an ever-increasing 
number of areas of mathematics and of its applications. This is not too 
surprising, since differentiable manifolds are the underlying, if unacknow- 
ledged, objects of study in much of advanced calculus and analysis. Indeed, 
such topics as line and surface integrals, divergence and curl of vector 
fields, and Stokes’s and Green’s theorems find their most natural setting in 
manifold theory. But however natural the leap from calculus on domains of 
Euclidean space to calculus on manifolds may be to those who have made 
it, it is not at all easy for most students. It usually involves many weeks 
of concentrated work with very general concepts (whose importance is not 
clear until later) during which the relation to the already familiar ideas in 
calculus and linear algebra becomes lost—not irretrievably, but for all too 
long. Simple but nontrivial examples that illustrate the necessity for the 
high level of abstraction are not easy to present at this stage, and a 
realization of the power and utility of the methods must often be postponed 
for a dismayingly long time. 

This book was planned and written as a text for a two-semester course 
designed, it is hoped, to overcome, or at least to minimize, some of these 
difficulties. It has, in fact, been used successfully several times in preliminary 
form as class notes for a two-semester course intended to lead the student 
from a reasonable mastery of advanced (multivariable) calculus and a 
rudimentary knowledge of general topology and linear algebra to a solid 
fundamental knowledge of differentiable manifolds, including some facility 
in working with the basic tools of manifold theory: tensors, differential 
forms, Lie and covariant derivatives, multiple integrals, and so on. Although 
in overall content this book necessarily overlaps the several available 
excellent books on manifold theory, there are differences in presentation and 
emphasis which, it is hoped, will make it particularly suitable as an introduc- 
tory text. 
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To begin with, it is more elementary and less encyclopedic than 
most books on this subject. Special care has been taken to review, and 
then to develop, the connections with advanced calculus. In particular, all 
of Chapter II is devoted to functions and mappings on open subsets of 
Euclidean space, including a careful exposition and proof of the inverse 
function theorem. Efforts are made throughout to introduce new ideas 
gradually and with as much attention to intuition as possible. This has led 
to a longer but more readable presentation of inherently difficult material. 
When manifolds are first defined, an effort is made to have as many non- 
trivial examples as possible; for this reason, Lie groups, especially matrix 
groups, and certain quotient manifolds are introduced early and used 
throughout as examples. A fairly large number of problems (almost 400) 
is included to develop intuition and computational skills. 

Further, it may be said that there has been a conscious effort to avoid 
or at least to economize generality insofar as that is possible. Concepts are 
often introduced in a rather ad hoc way with only the generality needed 
and, if possible, only when they are actually needed for some specific purpose. 
This is particularly noticeable in the treatment of tensors—which is far from 
general—and in the brief introduction to vector bundles (more specifically 
to the tangent bundle). Thus it is not claimed that this is a compre- 
hensive book ; the student will emerge with gaps in his knowledge of various 
subjects treated (for example, Lie groups or Riemannian geometry). On the 
other hand it is hoped that he will acquire strong motivation, computa- 
tional skills, and a feeling for the subject that will make it easy for him to 
proceed to more advanced work in any of a number of areas using manifold 
theory: differential topology, Lie groups, symmetric and homogeneous 
spaces, harmonic analysis, dynamical systems, Morse theory, Riemann 
surfaces, and so on. 

Finally, it should be said that the author has tried to include at every 
stage results that illustrate the power of these ideas. Chapter VI is especially 
noteworthy in this respect in that it includes complete proofs of Brouwer’s 
fixed point theorem and of the nonexistence of nowhere-vanishing continuous 
vector fields on even-dimensional spheres. In a similar vein, the existence 
of a bi-invariant measure on compact Lie groups is demonstrated and 
applied to prove the complete reducibility of their linear representations. 
Then, in a later chapter, compact groups are used as simple examples of 
symmetric spaces, and their corresponding geometry is used to prove that 
every element lies on a one-parameter subgroup. In the last two chapters, 
which deal with Riemannian geometry of abstract n-dimensional manifolds, 
the relation to the more easily visualized geometry of curves and surfaces 
in Euclidean space is carefully spelled out and is used to develop the general 
ideas for which such applications as the Hopf-Rinow theorem are given. 
Thus, by a selection of accessible but important applications, some truly 
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nontrivial, unexpected (to the student) results are obtained from the abstract 
machinery so patiently constructed. 

Briefly, the organization of the book is as follows. Chapter I is a very 
intuitive introduction and fixes some of the conventions and notation that 
are used. Chapter II is largely advanced calculus and may very well be 
omitted or skimmed by better prepared readers. In Chapter III, the basic 
concept of differentiable manifold is introduced along with mappings of 
manifolds and their properties; a fairly extensive discussion of examples is 
included. Chapter IV is particularly concerned with vectors and vector 
fields and with a careful exposition of the existence theorem for solutions 
of systems of ordinary differential equations and the related one-parameter 
group action. In Chapter V covariant tensors and differential forms are 
treated in some detail and then used to develop a theory of integration 
on manifolds in Chapter VI. Numerous applications are given. It would 
be possible to use Chapters II-VI as the basis of a one-semester course 
for students who wish to learn the fundamentals of differentiable manifolds 
without any Riemannian geometry. On the other hand, for students who 
already have some experience with manifolds, Chapters VII and VIII could 
serve as a brief introduction to Riemannian geometry. In these last two 
chapters, beginning from curves and surfaces in Euclidean space, the concept 
of Riemannian connection and covariant differentiation is carefully 
developed and used to give a fairly extensive discussion of geodesics— 
including the Hopf-Rinow theorem—and a shorter treatment of curvature. 
The natural (bi-invariant) geometry on compact Lie groups and Riemannian 
manifolds of constant curvature are both discussed in some detail as examples 
of the general theory. This discussion is based on a fairly complete treatment 
of covering spaces, discontinuous group action, and the fundamental 
group given earlier in the book. 

This book, as do many of the books in this subject, owes much to the 
influence of S. S. Chern. For many years his University of Chicago notes— 
—still an important reference (Chern [1])—were virtually the only systematic 
account of most of the topics in this text. Even more importantly his 
courses, lectures, published works, and above all his personal encourage- 
ment have had an impressive influence on a whole generation of differential 
geometers, among whom this author had the good fortune to be included. 
Another source of inspiration to the author was the work of John Milnor. 
The manner in which he has made exciting fundamental research in 
differential topology and geometry available to specialist and nonspecialist 
alike through many careful expository works (written in a style that this 
author particularly admires) certainly deserves gratitude. No better material 
for further or supplemental reading to this text could be suggested than 
Milnor’s two books [1] and [2]. 

For part of the time during which this book was being written, the 
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author benefitted from a visiting professorship at the University of 
Strasbourg, France, and he is particularly grateful for the opportunity to 
work there, in an atmosphere so conducive to advancing in the task he 
had undertaken. 

The author would also like to acknowledge with gratitude the help given 
to him by his son, Thomas Boothby, by students and colleagues at 
Washington University, especially Humberto Alagia and Eduardo Cattani, 
and by Mrs. Virginia Hundley for her careful work preparing the manuscript. 
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I INTRODUCTION TO MANIFOLDS 


In this chapter, we establish some preliminary notations and give an intuitive, geometric 
discussion of a number of examples of manifolds—the primary objects of study throughout the 
book. Most of these examples are surfaces in Euclidean space; for these—together with curves 
on the plane and in space—were the original objects of study in classical differential geometry 
and are the source of much of the current theory. 

The first two sections deal primarily with notational matters and the relation between 
Euclidean space, its model R”, and real vector spaces. In Section 3 a precise definition of 
topological manifolds is given, and in the remaining sections this concept is illustrated. 


1 Preliminary Comments on R” 


Let R denote the real numbers and R” their n-fold Cartesian product 


É x- xR, 


the set of all ordered n-tuples (x',..., x") of real numbers. Individual n- 
tuples may be denoted at times by a single letter. Thus x = (Senge | 
a = (a',...,a"), and so on. We agree once and for all to use on R" its 
topology as a metric space with the metric defined by 


ats, 9) = (6 vy) - 


1 
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The neighborhoods are then open balls B?(x), or B,(x) or, equivalently, open 
cubes C?(x), or C,(x) defined for any € > 0 as 


B,(x) = {ye R" | d(x, y) < e}, 
and 
C(x) = {ye R"| |x- y| <e i= 1,..., n}, 


a cube of side 2e and center x. Note that R' = R and we define R° to be a 
single point. 

We shall invariably consider R” with the topology defined by the metric. 
This space R” is used in several senses, however, and we must usually decide 
from the context which one is intended. Sometimes R” means merely R” as 
topological space, sometimes R" denotes an n-dimensional vector space, and 
sometimes it is identified with Euclidean space. We will comment on this last 
identification in Section 2 and examine here the other meanings of R”. 

We assume that the definition and basic theorems of vector spaces over 
R are known to the reader. Among these is the theorem which states that any 
two vector spaces over R which have the same dimension n are isomorphic. 
It is important to note that this isomorphism depends on choices of bases in 
the two spaces; there is in general no natural or canonical isomorphism 
independent of these choices. However, there does exist one important 
example of an n-dimensional vector space over R which has a distinguished 
or canonical basis—a basis which is somehow given by the nature of the 
space itself. We refer to the vector space of n-tuples of real numbers with 
componentwise addition and scalar multiplication. This is, as a set at least, 
just R”; should we wish on occasion to avoid confusion, then we will denote 
it by V” (and use boldface for its elements (x instead of x, and so forth). For 
this space the n-tuples e, = (1, 0, ..., 0), ..., e, = (0, 0, ..., 0, 1) are a basis, 
known as the natural or canonical basis. We may at times suppose that the 
n-tuples are written as rows, that is, | x n matrices, and at other times as 
columns, that is, n x 1 matrices. This only becomes important should we 
wish to use matrix notation to simplify things a bit, for example, to describe 
linear mappings, equations, and so on. 

Thus R” may denote a vector space of dimension n over R. We sometimes 
mean even more by R”. An abstract n-dimensional vector space over R is 
called Euclidean if it has defined on it a positive definite inner product. In 
general there is no natural way to choose such an inner product, but in the 
case of R” or V”, again we have the natural inner product 


(x,y) = È xyi. 


i=1 


It is characterized by the fact that relative to this inner product the natural 
basis is orthonormal, (e;, e;) = 4,;. 
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Thus at times R” is a Euclidean vector space, but one which has a built-in 
orthonormal basis and inner product. An abstract vector space, even if 
Euclidean, does not have any such preferred basis. The metric in R” dis- 
cussed at the beginning can be defined using the inner product on R". We 
define ||x||, the norm of the vector x, by ||x|| = ((x, x))'’?. Then we have 


d(x, y) = ||x — y|. 
This notation is frequently useful even when we are dealing with R” as a 
metric space and not using its vector space structure. Note, in particular, 
that || x || = d(x, 0), the distance from the point x to the origin. In this equal- 
ity x is a vector on the left-hand side, and x is the corresponding point on 
the right-hand side; an illustration of the way various interpretations of R” 
can be mixed together. 


Exercises 


1. Show that if A isanm x n matrix, then the mapping from V” to V” (with 
elements written as n x 1 and m x 1 matrices), which is defined by 
y = Ax, is continuous. Identify the images of the canonical basis of V” as 
linear combinations of the canonical basis of V”. 

2. Find conditions for the mapping of Exercise 1 to be onto; to be 
one-to-one. 

3. Show that if W is an n-dimensional Euclidean vector space, then there 
exists an isometry, that is, an isomorphism preserving the inner product, 
of W onto R” interpreted as Euclidean vector space. 

4. Show that C”, the space of n-tuples of complex numbers, may be placed 
in one-to-one correspondence with R2". Can this correspondence be a 
vector space isomorphism? 

5. Exhibit an isomorphism between the vector space of m x n matrices 
over R and the vector space R™”. Show that the map X > AX, where A 
is a fixed m x m matrix and X is an arbitrary m x n matrix (over R), is 
continuous in the topology derived from R™. 

6. Show that ||x|| has the following properties: 


(a) [Ix + yl] < lx + ly]; 
(e) Ixl] — lly < Ix — yl; 
(c) Jax = a| [x], ve R; and 

(d) explain how (a) is related to the triangle inequality of d(x, y). 


7. Show that an isometry of a Euclidean vector space onto itself has an 
orthogonal matrix relative to any orthonormal basis. 

8. Prove that every Euclidean vector space V has an orthonormal basis. 
Construct your proof in such a way that if W is a given subspace of V, 
dim W = r, then the first r vectors of the basis of V are a basis of W. 
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2 R" and Euclidean Space 


Another role which R” plays is that of a model for n-dimensional 
Euclidean space E”, in the sense of Euclidean geometry. In fact many texts 
simply refer to R” with the metric d(x, y) as Euclidean space. This 
identification is misleading in the same sense that it would be misleading to 
identify all n-dimensional vector spaces with R”; moreover unless clearly 
understood, it is an identification that can hamper clarification of the con- 
cept of manifold and the role of coordinates. Certainly Euclid and the 
geometers before the seventeenth century did not think of the Euclidean 
plane or three-dimensional space—which we denote by E? and E*—as pairs 
or triples of real numbers. In fact they were defined axiomatically beginning 
with undefined objects such as points and lines together with a list of their 
properties—the axioms—from which the theorems of geometry were then 
deduced. This is the path which we all follow in learning the basic ideas of 
Euclidean plane and solid geometry, about which most of us know quite a 
bit before studying analytic or coordinate geometry at all. The identification 
of R” and E" came about after the invention of analytic geometry by Fermat 
and Descartes and was eagerly seized upon since it is very tricky and 
difficult to give a suitable definition of Euclidean space, of any dimension, in 
the spirit of Euclid, that is, by giving axioms for (abstract) Euclidean space 
as one does for abstract vector spaces. This difficulty was certainly 
recognized for a very long time, and has interested many great mathemati- 
cians. It led in part to the discovery of non-Euclidean geometries and thus to 
manifolds. A careful axiomatic definition of Euclidean space is given by 
Hilbert [1]. Since our use of Euclidean geometry is mainly to aid our 
intuition, we shall be content with assuming that the reader “knows” this 
geometry from high school. 

Consider the Euclidean plane Æ? as studied in high school geometry; 
definitions are made, theorems proved, and so on, without coordinates. One 
later introduces coordinates using the notions of length and perpendicular- 
ity in choosing two mutually perpendicular “number axes” which are used 
to define a one-to-one mapping of E? onto R? by p > (x(p), y(p)), the coor- 
dinates of p e E’. This mapping is (by design) an isometry, preserving dis- 
tances of points of E? and their images in R?. Finally one obtains further 
correspondences of essential geometric elements, for example, lines of £? 
with subsets of R? consisting of the solutions of linear equations. Thus we 
carry each geometric object to a corresponding one in R?. It is the existence 
of such coordinate mappings which make the identification of E’ and R? 
possible. But caution! An arbitrary choice of coordinates is involved, there is 
no natural, geometrically determined way to identify the two spaces. Thus, at 
best, we can say that R? may be identified with E? plus a coordinate system. 
Even then we need to define in R? the notions of line, angle of lines, and 
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other attributes of the Euclidean plane before thinking of it as Euclidean 
space. Thus, with qualifications, we may identify E? and R? or E" and R”, 
especially remembering that they carry a choice of rectangular coordinates. 

We conclude with a brief indication of why we might not always wish to 
make the identification, that is, to use the analytic geometry approach to the 
study of a geometry. Whenever E" and R" are identified it involves the choice 
of a coordinate system, as we have seen. It then becomes difficult at times to 
distinguish underlying geometric properties from those which depend on the 
choice of coordinates. An example: Having identified E? and R? and lines 
with the graphs of linear equations, for instance, 


L = {(x, y)|y = mx + b}, 


we define the slope m and the y-intercept b. Neither has geometric meaning; 
they depend on the choice of coordinates. However, given two such lines of 
slope m,, mz, the expression (m, — m,)/(1 + m, m2) does have geometric 
meaning. This can be demonstrated by directly checking independence of 
the choice of coordinates—a tedious process—or determining that its value 
is the tangent of the angle between the lines, a concept which is independent 
of coordinates! It should be clear that it can be difficult to do geometry, even 
in the simplest case of Euclidean geometry, working with coordinates alone, 
that is, with the model R”. We need to develop both the coordinate method 
and the coordinate-free method of approach. Thus we shall often seek ways 
of looking at manifolds and their geometry which do not involve coordin- 
ates, but will use coordinates as a useful computational device (and more) 
when necessary. 

However, being aware now of what is involved, we shall usually refer to 
R" as Euclidean space and make the identification. This is especially true 
when we are interested only in questions involving topology—as in the next 
section—or differentiability. 


Exercises 


1. Using standard equations for change of Cartesian coordinates, verify 
that (m, — m,)/(1 + m, m2) is independent of the choice of coordinates. 

2. Similarly, show that ((x3 — x1)? + (y2 — y1)?)"”? is a function of points 
P,(x;,¥;) and P3(x2, y2) which does not depend on the choice of 
coordinates. = 

3. How do we describe the subset of R” which corresponds to a segment pq 
in E"? to a line? to a 2-plane not through the origin? 


If we wish to prove the theorems of Euclidean geometry by analytical geometry methods, 
we need to define the notion of congruence. We say that two figures are congruent if there is a 
rigid motion of the space, that is, an isometry or distance-preserving transformation, which 
carries one figure to the other. 
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4. Identifying E? with R?, describe analytically the rigid motions of R°. 
Show that they form a group. 

5. Using Exercise 4 prove that two triangles are congruent if and only if 
corresponding sides are of equal length. 


3 Topological Manifolds 


Of all the spaces which one studies in topology the Euclidean spaces and 
their subspaces are the most important. As we have just seen, the metric 
spaces R” serve as a topological model for Euclidean space Æ”, for finite- 
dimensional vector spaces over R or C, and for other basic mathematical 
systems which we shall encounter later. It is natural enough that we are led 
to study those spaces which are locally like R", more precisely those spaces 
for which each point p has a neighborhood U which is homeomorphic to an 
open subset U’ of R”, n fixed. We say that a space with this property is locally 
Euclidean of dimension n, and in order to stay as close as possible to 
Euclidean spaces, we will consider spaces called manifolds, defined as 
follows. 


(3.1) Definition A manifold M of dimension n, or n-manifold, is a topologi- 
cal space with the following properties: 


(i) M is Hausdorff, 
(ii) M is locally Euclidean of dimension n, and 
(iii) M has a countable basis of open sets. 


As a matter of notation dim M is used for the dimension of M; when 
dim M = 0, then M is a countable space with the discrete topology. It 
follows from the homeomorphism of U and U' that locally Euclidean is 
equivalent to the requirement that each point p have a neighborhood U 
homeomorphic to an n-ball in R”. Thus a manifold of dimension 1 is locally 
homeomorphic to an open interval, a manifold of dimension 2 is locally 
homeomorphic to an open disk, and so on. Our first examples will remove 
any lingering suspicion that an n-manifold is necessarily globally equivalent, 
that is, homeomorphic, to £". 


(3.2) Example Let M be an open subset of R” with the subspace topo- 
logy; then M is an n-manifold. 


Indeed properties (i) and (iii) of Definition 3.1 are hereditary, holding for 
any subspace of a space which possesses them; and we see that (ii) holds with 
U = U’ = M and with the homeomorphism of U to U’ being the identity 
map. A bit of imagination, assisted perhaps by Fig. I.1, will show that even 
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A a ae 


{a) (b) 


Figure I.1 


(a) The manifold is the open set M of R? between the curves C and C'. (b) The manifold is 
the open subset of R? obtained by removing the knots. 


when n = 2 or 3 these examples can be rather complicated and certainly not 
equivalent to Euclidean space in general, although they may be in special 
cases: a trivial such case is M = E". 


(3.3) Example The simplest examples of manifolds not homeomorphic to 
open subsets of Euclidean space are the circle S! and the 2-sphere S?, which 
may be defined to be all points of E?, or of E°, respectively, which are at unit 
distance from a fixed point 0. 

These are to be taken with the subspace topology so that (i) and (iii) are 
immediate. To see that they are locally Euclidean we introduce coordinate 
axes with 0 as origin in the corresponding ambient Euclidean space. Thus in 
the case of S* we identify R? and E>, and S? becomes the unit sphere 
centered at the origin. At each point p of S? we have a tangent plane and a 
unit normal vector N,. There will be a coordinate axis which is not perpen- 
dicular to N, and some neighborhood U of p on S? will then project in a 
continuous and one-to-one fashion onto an open set U’ of the coordinate 
plane perpendicular to that axis. In Fig. I.2a, N, is not perpendicular to the 
X-axis so for qe U, the projection is given quite explicitly by @(q) = 
(x*(q), 0, x°(q)), where (x"(q), x?(q), x°(q)) are the coordinates of q in E°. 
Similar considerations show that S’ is locally Euclidean. Note that S$? and 
R? cannot be homeomorphic since one is compact while the other is not. 


(3.4) Example Our final example is that of the surface of revolution ob- 
tained by revolving a circle around an axis which does not intersect it. The 
figure we obtain is the torus or “inner tube” (denoted T?) as shown in 
Fig. I.2b. This figure can be studied analytically; it is easy to write down an 
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(a) (b) 
Figure 1.2 
(a) The spherical surface S? as a manifold. (b) The torus as a manifold. 


equation whose locus it is if we introduce coordinates in E? as shown in the 
figure. In order to convince ourselves that it is indeed locally Euclidean we 
consider once more the normal vector N, at p€ T?. There will be at least one 
coordinate axis to which it is not perpendicular, say x°. Then some neigh- 
borhood U of p projects homeomorphically onto a neighborhood U’ in the 
x!x?-plane as illustrated. Since we use the relative topology derived from E°, 
the space T? is necessarily Hausdorff and has a countable basis of open sets. 
Thus conditions (i)-(iii) of Definition 3.1 are satisfied. 


(3.5) Remark It should be clear from the last two examples that certain 
subspaces M of E? are easily seen to be 2-manifolds; they are surfaces which 
are “smooth,” that is, without corners or edges, so that they have at each 
peM a (unit) normal vector N, and tangent plane T,(M)—to introduce 
notation we use later—which varies continuously as we move from point to 
point. (By this last requirement we mean that the components of the unit 
normal vector depend continuously on the point p.) This smoothness allows 
us to prove the locally Euclidean property by projection of a neighborhood 
of p onto a plane as in Examples 3.3 and 3.4. The other properties are 
immediate since we use the subspace topology. Figure I.3 shows some fur- 
ther examples of manifolds which can be obtained in this way. Obviously 
this method will not always work: The surface of a cube is a 2-manifold, in 
fact it is homeomorphic to S?; but it has no tangent plane or normal vector 
at the corners and edges. 


Example 3.2 gives an inkling at least, of how nasty a space can be and 


still be a manifold, even when it is connected—which we do not suppose in 
general. The following theorem will offer some reassurance. 
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Figure 1.3 


(3.6) Theorem A topological manifold M is locally connected, locally 
compact, and a union of a countable collection of compact subsets ; furthermore, 
it is normal and metrizable. 


Proof These are all immediate consequences of the definition and stan- 
dard theorems of general topology. Let p be a point of M and U a neighbor- 
hood of p homeomorphic to an open ball B,(x) of radius £ in R”. We denote 
this homeomorphism by g, and we suppose (p) = x. Then it is clear that 
interior to any neighborhood V of p there is a neighborhood W whose 
closure W is in U and for which ọ(W) = B;(x) with e > 6 > 0. It follows 
that M is locally connected at p since B;(x) and hence W, to which it is 
homeomorphic by g~', is connected. Similarly W is compact since B;(x) is 
compact; thus M is locally compact. Because M has a countable base of 
open sets, we may now suppose that it has a countable base of relatively 
compact open sets {V}; obviously M = |) V;. Normality follows from 
Lindelöf’s theorem and metrizability is then a consequence of the Urysohn 
metrization theorem (see Kelley [1]). 


There is one difficulty in our concept of manifold about which we can do 
nothing at present. In fact it concerns Euclidean spaces and their topology. 
and arises even before consideration of manifolds: it is the question of 
dimension. Could it be that Æ" and E” are homeomorphic, or locally 
homeomorphic—so that an open set U of E” is homeomorphic to some open 
set U’ of E” with m + n? The answer is no, but the proof is difficult and 
requires algebraic topology. It was proved in 1911 by L. E. J. Brouwer and is 
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known as Brouwer’s theorem on invariance of domain. For a proof see 
Hurewicz and Wallman [1]. Later we shall be able to give a differentiable 
version of this theorem which will be sufficient for our needs; in this chapter 
we assume the theorem. 

We make one final remark which connects this section with the preced- 
ing one. The notion of coordinates plays an important role in manifold 
theory, just as it does in the study of the geometry of £". In E", however, it is 
possible to find a single system of coordinates for the entire space, that is, to 
establish a correspondence between all of E” and R”. Built into the definition 
of n-manifold M is a correspondence of a neighborhood U of each pe M and 
an open subset U’ of R”. Letting @ : U > U’ be this correspondence, we call 
the pair U, @ a coordinate neighborhood and the numbers x(q), ..., x"(q), 
given by o(q) = (x'(q), ---, x"(q)), the coordinates of qe M. We have 
assumed that this g is a homeomorphism: it is one-to-one and g and o`! 
are continuous. Thus each q € U has n uniquely determined coordinates, real 
numbers, which vary continuously with q. Of course the function q > x(q), 
which gives the ith coordinate, 1 < i < n, is continuous; it is called the ith 
coordinate function. There is obviously nothing unique about our choice of 
coordinates; in Examples 3.3 and 3.4, we could equally well project the 
neighborhood of p discussed there to other coordinate planes. Finally note 
that even in the case of Euclidean space it is often useful to use local coordin- 
ates; the domain of a polar coordinate system on E”, for example, must omit 
a ray if it is to be one-to-one. 


Exercises 


1. Consider the following subset of R? : X = A, U A u B with 
A, = {(x, y)[x20,y= +1}, 
A- = {(x, y)[x20,y = —1} 
B = {(x, y)|x <0, y = 0}. 


Define a topology as follows: We use the subspace topology (open 
intervals as a basis) on A, — {(0, 1)}, 4- — {(0, —1)} and B; then for 
e > Owe let NZ = {(x, +1)|0 < x < e} u {(x, 0)| —e < x < O} and use 
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N; and N; as a basis of neighborhoods of (0, 1) and (0, — 1), respec- 
tively. Show that the space X is locally Euclidean but is not a manifold. 


A Hausdorff space M is said to be paracompact if every covering {U,} of M by open sets has 
a locally finite refinement; more precisely. there is a covering |V,} which (i) refines {U,} in the 
sense that each V, c U, for some g, and which (ii) is locally finite, that is, each pe M has a 
neighborhood W which intersects only a finite number of sets V,. 


2. Show that a manifold is paracompact. Show that a locally Euclidean, 
paracompact, Hausdorff space need not have a countable basis. 

3. Show that a connected manifold M is pathwise connected, that is, 
p,qéM implies that there exists a continuous curve f(s), 0 < s <1, 
with f(0) = p, f (1) = q. 

4. Show that the (connected) components of a manifold M are open sets 
and are countable in number. 


4 Further Examples of Manifolds. Cutting and Pasting 


A hemispherical cap (including the equator) or a right circular cylinder 
(including the circles at the ends) are typical examples of manifolds with 
boundary. Except for the equator, or the end-circles, they are 2-manifolds 
and these boundary sets are themselves manifolds of dimension one less. In 
fact, they are homeomorphic to S+ or to S' U S? in these two cases. An even 
simpler example is the upper half-plane H?, or more generally H”, where we 
shall mean by H” the subspace of R” defined by 


H" = {(x',..., x"JeR"| x" > 0}. 


Every point pe H” has a neighborhood U which is homeomorphic to an 
open subset U’ of R” except the set of points (x',..., x" 1, 0), which ob- 
viously forms a subspace homeomorphic to R"~', called the boundary of H” 
and denoted by ðH”. 

We shall define a manifold with boundary to be a Hausdorff space M with 
a countable basis of open sets which has the property that each pe M is 
contained in an open set U with a homeomorphism ọ to either (a) an open 
set U’ of H” — dH" or (b) to an open set U’ of H” with g(p) e ôH”, that is, a 
boundary point of H”. It can be shown (as a consequence of invariance of 
domain) that pe M is in one class or the other but not both; those p of the 
first type are called interior points of M and those p mapped onto the 
boundary of H” by one, and hence by all, homeomorphisms of their neigh- 
borhoods into H” are called boundary points. The collection of boundary 
points is then denoted by ðM and is called the boundary of M. It is a 
manifold of dimension n — 1. We make no attempt to prove these facts here, 
but they will be discussed briefly in Chapter VI. 
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Figure 1.4 


Some examples of pasting. 


Our interest is in pointing out that new surfaces, that is, 2-manifolds, can 
be formed by fastening together manifolds with boundary along their boun- 
daries, that is, by identifying points of various boundary components by a 
homeomorphism, assuming of course the necessary condition that such 
components are homeomorphic. The simplest examples are S?, which is 
obtained by pasting two disks (or hemispheres) together so as to form the 
equator, and T?, formed by pasting the two end-circles of a cylinder 
together. However, one can go much further and paste any number of cylin- 
ders onto a sphere S? with “holes,” that is, with circular disks removed. This 
gives various pretzellike surfaces as illustrated in Fig. 1.4. We leave as an 
exercise the proof that these are manifolds. Thus to generate new 2- 
manifolds from old ones we may (1) cut out two disks, leaving a manifold M 
whose boundary @M is the disjoint union of two circles, and (2) paste on a 
cylinder or “handle” so that each end-circle is identified with one of the 
boundary circles of M. 

The pasting on of handles is not the only way in which we can generate 
examples of 2-manifolds. It is also possible to do so by identifying or pasting 
together the edges of certain polygons. For example, the sides of a square 
may be identified in various ways in order to obtain surfaces. Figure 1.5 
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illustrates this: we obtain a cylinder, Möbius band, torus, and Klein bottle. 
The latter cannot be pictured as a surface in E> unless we allow it to cut itself 
as shown. Thus as a subspace of E? it is not a manifold: it is possible to 
identify the sides of the square, as shown, and obtain a manifold—but it is 
not possible to put it inside E°. 

For connected 2-manifolds M which lie smoothly inside £? so that there 
is a tangent plane and normal line L, at each point p, we may ask whether it 
is possible to choose a unit normal vector N, (on L,) for every pe M which 
varies continuously with M. It is easy to see that this is possible for S? and 
T? but not for the Möbius band (which is actually a manifold with boun- 
dary) or the Klein bottle. We say that a manifold or manifold with boundary 
is orientable if such a choice of N, is possible. The following is a fundamental 
theorem of 2-manifolds. 


Cylinder a Twisted (mobius) 


band 
(a) (b) 
Klein bottle 


(c) (d) 
Figure I.5 


Four ways to identify sides of a rectangle: (a) cylinder; (b) twisted (Möbius) band; (c) torus; (d) 
Klein bottle. 
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(4.1) Theorem Every compact, connected, orientable 2-manifold is homeo- 
morphic to a sphere with handles added. Two such manifolds with the same 
number of handles are homeomorphic and conversely, so that the number of 
handles (called the genus) is the only topological invariant. 


This is a very satisfying theorem in that it shows that 2-manifolds of a 
certain large class can be enumerated and completely described to within 
homeomorphism (for a proof see Massey [1]). This can actually be carried 
further. Nonorientable as well as noncompact 2-manifolds can be described 
equally completely—although the noncompact case is more involved as 
might be expected. One can show also that every connected, one- 
dimensional manifold is homeomorphic to S' or to R depending on whether 
it is compact or not. However, beginning with n = 3 everything is far more 
complicated. In fact A. A. Markov has shown that for n = 4 no algorithm 
classifying compact orientable manifolds is possible (see Massey [1, p. 144). 

Curves and surfaces, that is, one- and two-dimensional manifolds in E°, 
formed the objects of study in classical differential geometry. We shall 
frequently refer to them as sources of examples and new ideas. 


Exercises 


1. Assuming invariance of domain, show that 0H” is a manifold of dimen- 
sion n — 1 and that no neighborhood in H” of a point of dH” can be 
homeomorphic to an open subset of R". 

2. Prove that adding a handle to a 2-manifold in the fashion described 
above for S? and T? actually does give a 2-manifold. 

3. Prove in detail that it is possible to obtain a 2-manifold by identifying 
sides of the square as shown in Fig. I.5d (Klein bottle). 

4. Prove that identification of points at opposite ends of diameters on the 
boundary of the circular disk D? defines a 2-manifold. 


According to a theorem of topology, if a compact orientable 2-manifold is obtained by 
pasting together triangles along their edges, then the number x = f — e + v (faces — edges + 
vertices) is the same for two surfaces M, and M, which are homeomorphic: z is independent of 
the way the surface is cut up into triangles. (x is called the Euler characteristic of the surface.) 


5. Let My = S? and M, be the surface obtained from Mo by adding g 
handles. Compute the relation between the genus g and the Euler 
characteristic x. 


5 Abstract Manifolds. Some Examples 


The manifolds of dimensions 1 and 2 considered above are pictured as 
subspaces of E?’ except in the case of the Klein bottle. This is the way in 
which manifolds are first and most easily visualized. However, the definition 
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makes no such requirement. Such visualization makes equivalent (homeo- 
morphic) manifolds look different just because they are differently placed in 
Euclidean space; and we might easily be led to think that they are different. 
Several examples of equivalent manifolds are shown in Fig. 1.6. In spite of 
appearances, they are homeomorphic. 


Figure 1.6 


Three equivalent manifolds. 


As we might expect from the definition, it is possible to give examples of 
manifolds which we do not think of as lying in Euclidean space. Indeed, it is 
not clear that they can be realized at all as a subspace of Euclidean space. 
This can already be guessed from the construction of manifolds by pasting, 
which does not really use E° at all. The simplest, as well as one of the most 
important examples of manifolds defined “abstractly ”—-not as a subspace of 
Euclidean space—is real projective space P"(R), the space of (real) projective 
geometry. It may be defined as follows. Let an equivalence relation ~ be 
defined on R"** — {0} by 


pack ea) 


if there is a real number ¢ such that y' = tx} i = 1,...,n + 1; briefly y = tx. 
Then we denote by [x] the equivalence class of x and by P"(R) the set of 
equivalence classes. There is a natural map z : R"*! — {0} > P"(R) given by 
n(x) = [x] and we shall topologize P"(R), as is usual in the case of such 
quotient spaces, by saying that U c P"(R) is open if and only if n7 t(U) is 
open in R"*'. This gives P"(R) the structure of an n-manifold (as shown in 
the exercises). We note that there is an alternative description of P"(R) as the 
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space of all lines through the origin 0 of R"*t!- x takes each x # 0 to the line 
through 0 which contains it. Then we define the topology as follows: a 
collection U of lines is open if it is the set of all lines through 0 which meet a 
given open set U. 

This example may be generalized as follows: Let M be the set of all 
r-planes through the origin in R”, where n and r are fixed; for example, the 
set of all planes through the origin in R? or the set of all three-dimensional 
planes through the origin of R$, and so on. This set has a natural topology 
which makes it a manifold. Intuitively it consists of defining a neighborhood 
of a given plane p to be all planes q which are “close” to it in a more or less 
obvious sense: there exist corresponding bases of both planes p and q (con- 
sidered as r-dimensional subspaces of R”, as a vector space) such that corre- 
sponding basis vectors are close, say, for example, that their differences have 
norm less than some € > 0. 

Further important and useful examples of manifolds force themselves 
upon our attention when we begin to study the geometry of some of the 
manifolds we already have discussed. For example, consider S?, the unit 
sphere in R?. We denote by T(S’) the collection of all tangent vectors to 
points of S?, including the zero vector at each point. Thus T(S?) = 
(pes T,(S?). This set has a natural topology: two tangent vectors X , and 
Y, are “close” if their initial points p and q and their terminal points are 
close. Similarly, if M is any of the 2-manifolds we have considered which lie 
“smoothly” in E>, so as to have a tangent plane at each point which turns 
continuously as we move about on M, then T(M) = Jpem T,(M) is a mani- 
fold, called the tangent bundle of M. The dimension of T(M) is 4 since, 
roughly speaking, X, depends locally on four parameters: two being the 


Figure 1.7 


The 2-sphere S? and some of its tangent vectors—elements of T(S?). 
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local coordinates of p relative to some coordinate neighborhood U and two 
more being the components which determine X, relative to some basis 
{E1,,.E2,} of T,(M), a basis which varies continuously over the neighbor- 
hood U. We later make these statements quite precise and in so doing 
exhibit the locally Euclidean character of T(M). For the moment we note 
that E, and E, can be visualized as vectors tangent to the coordinate curves 
x! = constant and x? = constant in U. This is illustrated in Fig. I.7. 

We should note that these manifolds are not subspaces of E’, even 
though M is and although the geometry of E? is used here to describe them. 
In fact, one of our major tasks is to describe T,(M) and T(M) independently 
of any way of placing M in Euclidean space, that is, to give a description 
valid for an abstract manifold. 

The manifolds mentioned above arose quite naturally from studies of the 
geometry of curves and surfaces in E°. In fact, Gauss used, in a very essential 


Figure 18 
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way, a mapping which he introduced for orientable surfaces in E>. Let M be 
such a surface and let N, be a unit normal vector at each pe M, so defined 
that N, varies continuously with p on M. Translate N, to N, from a fixed 
origin 0 and let G(p) be the endpoint of N, on S?, the unit sphere at 0. The 
mapping taking p to G(p) is known as the Cais mapping, and the Gaussian 
curvature is a measure of the distortion of areas under this mapping: If M is 
sharply curved near p, then the area of a small region around p would be 
greatly magnified in mapping to S?. Even if M is not orientable, we still 
have a tangent plane T,(M) at each point p and it is parallel to a uniquely 
determined plane G(p) through the point 0. Thus a slight variant of the 
previous definition defines a mapping (as shown in Fig. 18) of M to the 
manifold of 2-planes through 0 introduced above. Or again, using normal 
lines instead of tangent planes, we can obtain a mapping from M to the 
manifold of lines through 0, which as we have remarked, is equivalent to 
P?(R). 


Exercises 


pen 


Show that P?(R) and the manifold of Exercise 4.4 are homeomorphic. 

2. Show that P?(R) and the set of all planes through the origin of R° are in 
natural one-to-one correspondence. 

3. Show that the set of all pairs (x, y) of mutually orthogonal unit vectors x 
and y of V°, with its natural inner product, is a manifold. What is its 
dimension? Generalize if possible. 

4. Prove that the manifold of orthonormal pairs of vectors in V° 
(Exercise 3) is homeomorphic to To(S7), the tangent sphere bundle of S, 
which consists of all unit vectors tangent to S?. 

5. Let C be a one-dimensional manifold (curve) in R°. Show that the 
collection of all vectors normal to C form a three-dimensional manifold. 
What sort of manifold would the unit vectors normal to C give us? 

6. Manifolds may be obtained as the locus of one y more algebraic equa- 

tions, for example, S? = {(x, y, z) | x? + y?-+ z? = 1}. Show that the 

torus T? may be given as the sed of an equation in x, y, z 


Notes 


Curves and surfaces in Euclidean space were studied since the earliest days of geometry 
and, after they were invented, both analytic geometry and calculus were systematically used in 
these studies. However, the discoveries of Gauss, announced in 1827, profoundly altered the 
course of differential geometry and pointed the way to the concept of abstract differentiable 
manifolds—the underlying spaces of every geometry and of other important mathematical 
theories as well. In his celebrated “ Theorema Egregium ” Gauss showed that there is a measure 
of curvature of a surface (now called the Gaussian curvature) which depends only on one’s 
ability to measure the lengths of curves on the surface. This means that this curvature is 
unchanged by alterations of shape of the surface which leave arclength unchanged. (It is easily 
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seen that there are such alterations. For example, we may roll a plane surface into a cylinder or 
cone, or we may gently squeeze a hemisphere in along its edge, the equator.) This discovery of 
an “inner” geometry, independent of the shape of the surface in E°, led very naturally toward 
the invention of abstract surfaces (2-manifolds) on which a measure of arclength is (somehow) 
provided. The discovery by Bolyai and Lobachevskii (independently) about 1830 of non- 
Euclidean geometry was another important step toward the concept of abstract spaces—spaces 
not conceived of as subspaces of Euclidean space. (Non-Euclidean geometry satisfies all of 
Euclid’s postulates except the one which affirms that through any point p not on a line L there is 
exactly one line parallel to L. As in Euclidean geometry, lengths of curves and distances between 
points have meaning; see hyperbolic space, Section VIII.6.) The existence of such spaces was 
apparently already known to Gauss, who kept his knowledge to himself— presumably for fear of 
the profoundly disturbing philosophical and religious consequences such a discovery would 
bring about. It can only be compared with the discovery of relativity in this century, for which it 
laid the foundation. The belief that we live in a Euclidean world had been an article of faith for 
centuries and was not easily challenged. 

A second great impetus to these new ideas was given by Riemann in his inaugural address at 
Göttingen in 1854. He introduced the idea of a space or “manifold” having its existence outside 
of Euclidean space but locally like it; he made clear what arc length would mean in this case (see 
Section V.3)—separating metric and topological properties. Both Euclidean and non-Euclidean 
geometry and spaces of arbitrary dimension were included within his ideas. Later he made 
extensive use of the notion of abstract two-dimensional manifolds in analytic function theory by 
his systematic use of Riemann surfaces. 

In the remainder of the 19th century many further important developments in geometry 
took place. The most important by far from the point of view of modern differential geometry 
and manifold theory being the monumental work of Poincaré. He introduced concepts and 
raised questions which have dominated much of the work in these fields in this century. Among 
them the use of groups to measure topological invariants of manifolds, the importance of 
manifolds in the study of dynamical systems, the Poincaré conjecture—central to an understand- 
ing of three dimensional manifolds but still unresolved, and many others. A second great impetus 
coming at the end of the 19th century and early in this century was the work on Lie groups, 
especially in the hands of the great differential geometer E. Cartan. In his famous Erlangen 
program of 1872 Felix Klein had shown the intimate relation of all geometries with groups, and 
with the later development of Lie groups (themselves important differentiable manifolds) it 
became clear that both Euclidean and non-Euclidean geometry could be thought of as branches 
of group theory. All of these applications gradually clarified the concepts themselves, as did the 
emergence of topology, so that the ideas of manifold theory and differential geometry are now 
highly developed and used across the entire mathematical spectrum, in relativity theory, analysis, 
Lie groups, algebraic topology, algebraic geometry, and elsewhere. The reader will find historical 
sketches in many of the references. In particular, Gauss’s famous paper [1] is available in an 
annotated English translation and Riemann’s Inaugural Address is translated in the book of 
Spivak [2], which contains an excellent and very readable analysis of this paper and is highly 
recommended for those interested in a history of differential geometry. The reader will also find 
an elegant intuitive discussion of surfaces given by Hilbert and Cohn-Vossen [1]. 
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In this chapter we review in some detail the differential calculus which we will nced later. 
The purpose is to build a bridge between the reader’s previous knowledge of multivariable 
calculus and the somewhat specialized facts we need here, especially the inverse function 
theorem and the theorem on rank. (Many readers can skim over or skip this chapter entirely.) 

Briefly, the topics treated are the following: In Section 1 we define differentiability of 
real-valued functions of many variables and its immediate consequences, in particular the mean 
value theorem. In Section 2 this is extended to the case that concerns us most, a mapping F 
from an open subset U of R” into R". Here the Jacobian is defined and the mean value theorem 
restated for mappings. Sections 3 and 4 deal with the concept of the space of tangent vectors 
T,(R") at a point a e R”; this will be most important in studying manifolds, especially Section 4 
in which T,(R”) is defined in a way that admits generalization. Section 5 reviews the notion of 
vector field in R”. Section 6 gives a detailed proof of the inverse function theorem. This theorem 
with its corollaries, especially the theorem on rank (Section 7), is one of the basic theorems on 
which most of our theory is built. 


1 Differentiability for Functions of Several Variables 


In this section we review briefly some facts about partial derivatives from 
advanced calculus. Few proofs are given; they may be worked out as prob- 
lems or found in advanced calculus texts, for example, Apostol [1], 


20 
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Dieudonné[1], or Fleming [1]. We will consider real-valued functions of 
several variables, more precisely functions whose domain is a subset A c R” 
and whose range is R. If f: 4 +R is such a function, then f(x) = 
f(x', ..., x") denotes its value at x = (x',..., x")e€ A. We assume through- 
out this section that fis a function on an open set U c R". At eacha e U, the 
partial derivative (éf/éx‘), of f with respect to x’ is, of course, the following 
limit, if it exists: 


1 i ny o 1 J n 
(2) — mt 28 + h,..., a") fla pene yy a) 


If of /6x! is defined, that is, the limit above exists at each point of U for 
1 <j < n, this defines n functions on U. Should these functions be contin- 
uous on U for 1 <j <n, f is said to be continuously differentiable on U, 
denoted by fe C'(U). 

Mere existence of partial derivatives is too weak a property for most 
purposes. For example, the function defined on R? by 


x 
ie ear, and f(0,0)=0 


f(x,y) 


is not continuous at (0, 0), yet both derivatives are defined there. The natural 
generalization of existence of the derivative for functions of one variable is as 
follows. We shall say that f is differentiable at ae U if there is a (homogen- 
eous) linear expression )7_, b;(x' — a‘) such that the (inhomogeneous) 
linear function defined by f(a) + )"_ , b,x‘ — a‘) approximates f(x) near a 
in the following sense: 


f(s) -Sla - Zbl- a’) _ 4 


ual |x- al 


or equivalently, if there exist constants b,,..., b, and a function r(x, a) 
defined on a neighborhood V of ae U which satisfy the following two 
conditions: 


f(x) =f (a) + ¥ bx! — a) + ||x — allr(x, a) 
on V, and 


lim r(x, a) = 0. 
If fis differentiable for every a € U, we say it is differentiable on U. [Warning: 
this is a technical definition from advanced calculus. Beginning with Chap- 
ter III differentiable will be used rather loosely to mean differentiable of 
some order, usually infinitely differentiable (C”).] Note that differentiability 
on U is a local concept, that is, if f is differentiable on a neighborhood of 
each point of U, then fis differentiable on U. By the mean value theorem, for 
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a function of one variable the existence of the derivative at a € U is equiva- 
lent to differentiability; but for functions of several variables, as we have 
seen, this is not the case. The exercises at the end of this section and the 
following statements (1.1)-(1.3), whose proofs we leave as exercises, will 
clarify these concepts. 


(1.1) Iff is differentiable at a, then it is continuous at a and all the partial 
derivatives (6f/0x'), exist. Moreover the b; are uniquely determined for each a 
at which f is differentiable; in fact b; = (6f/0x'),- 


By virtue of (1.1) when f is differentiable at a we have 


k of i i 
s-sa = ¥ (SE) (a! — a) + Ix = ars a) 
We denote by (df),, or simply df, the homogeneous linear expression on the 
right: 


2 (F)\ i_ ai 
(1.2) df= > (2) e a’). 
It is called the differential of f at a. 


(1.3) If of/ex', ..., Effex" are defined in a neighborhood of a and continuous 
at a, then f is differentiable at a. 


Thus existence and continuity of the partial derivatives of f on an open 
set U c R” implies differentiability of f at every point of U. We define 
inductively the notion of an r-fold continuously differentiable function on an 
open set U = R” (function of class C’): f is of class C” on U if its first 
derivatives are of class C’~!. Equivalently we may say that f has continuous 
derivatives of order 1, 2, ..., r on U.Iffis of class C” for all r, then we say that 
fis smooth, or of class C”. As in the case of C!, we denote these classes of 
functions on U by C’(U) and C®(U). 

We now state the first version of the chain rule; a more general version 
will be given in the next section. Define a differentiable (C") curve in R” to be 
a mapping of an open interval (a, b) = {xe R |a < x < b} of the real num- 
bers into R”, f: (a, b) > R", with f (t) = {x"(t),..., x"(t)}, where the n coordin- 
ate functions x'(t),..., x"(t) are differentiable (resp. C") on the interval. 
(Recall: For functions of one variable “differentiable” and “derivative exists” 
are equivalent.) Now suppose that fis a differentiable curve and maps (a, b) 
into U, an open subset of R". Let a < tọ < b and suppose that g is a function 
on U which is differentiable at f(tọ) € U. Then the composite function g o fis a 
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real-valued function on (a, b). We assert that g « fis differentiable at tọ and 
that its derivative at tọ is given by the chain rule 


d " (0g dx' 

The proof is left as an exercise. Using it we may establish the mean value 
theorem for functions of several variables. We shall say that a domain U is 
starlike with respect to ae U provided that whenever xe U, then the seg- 
ment ax lies entirely in U (see Fig. II.1). This is a somewhat weaker property 
than convexity of U, a convex set being starlike with respect to every one of 
its points. 


Figure II.1 


(1.5) Theorem (Mean Value Theorem) Let g be a differentiable function 
on an open set U c R"; let ae U and suppose that U is starlike with respect to 
a. Then given xe U there exists 0e R, 0 < 0 < 1, such that 


ats) — ala) = X (58) (0 = a 


the derivatives Cg/€x',...,é@g/¢x" all being evaluated at the same point 
a + 6(x — a) on the segment ax. 


Proof Set f(t) = a+ t(x — a), that is x‘(t) = a + t(x' — a’). Then the 
corresponding curve is a line segment with f (0) = aand f (1) = x. This curve 
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is differentiable, in fact C”, so that g - f maps [0, 1] into U and is differen- 
tiable on (0, 1). Applying the standard mean value theorem for functions of 
one variable (as in elementary differential calculus) and using (1.4) to com- 
pute the derivatives gives the formula. 


(1.6) Corollary Let U and g be as in Theorem 1.5. If |ég/6x'| < K on U, 
i= 1,2,...,n, then for any xe U, we have 


| g(x) — g(a)| < K /n |x — all. 
Proof Taking absolute values in the formula of Theorem 1.5 and using 
the Schwarz inequality gives 
ôg \? 
< 
< [z (x) 


È (ae -a 
| g(x) — gla)| < Ky/n||x — all. I 


The following corollary is an important consequence and should be 
proved as an exercise. 


1/2 


| g(x) — g(a)| = 


Therefore 


(1.7) Corollary If f is of class C" on U, then at any point of U the value of 
the derivatives of order k, 1 < k < r is independent of the order of differentia- 
tion, that is, if (jı, .--, ją) is a permutation of (i,,..., i), then 
f o'f 
Axil --- Axik T xiL- 8xi 

Taylor’s theorem on polynomial approximation with its formula for the 
remainder Ry,,, or error of the approximation of degree N, as well as the 
corollary theorem on power series expansions are easily extended to func- 
tions of several variables using the technique of Theorem 1.5 (see Apostol [1] 
and Dieudonné [1]). As in the single variable case, a necessary but not a 
sufficient condition that a function be (real) analytic, that is, can be ex- 
panded in a power series at each ae U, an open set of R", is that it be in 
C*(U). [We write fe C°(U) iffis real analytic on U.] Although knowledge of 
analytic functions is not needed in this text, it is helpful—since C° implies 
C*—to know that any linear function f(x) = ) a;x', or any polynomial 
P(x',..., x") of n variables, is an analytic function on U = R"; the same is 
true for any quotient of polynomials (rational function) if we exclude from 
the domain the points at which the denominator is zero. Thus, for example, a 
determinant is an analytic function of its entries and, if we exclude n x n 
matrices of determinant zero (which have no inverses), then each entry in the 
inverse A~' of a matrix a is an analytic (and hence C%) function of the 
entries in the matrix A. 
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Exercises 


Prove (1.1). 

. Prove (1.3) using the mean value theorem for functions of one variable. 

3. Prove that all first partial derivatives of a differentiable function vanish 
at an extremum. 

4. Let U be an open subset of R” and let C°(U) and C'(U) denote the 

continuous and continuously differentiable functions on U. Let D(U) 

denote the functions which are differentiable on U. Show that C°(U) > 

D(U) > C!(U) and construct examples to show that in general the inclu- 

sions are proper. 

Show that the inclusions C!(U) > C?(U) > ++ > C®(U) are proper. 
6. Prove that C” > C®, and that the inclusion is proper. [Hint: Let 
f(0) = 0, f(t) = exp(—1/t?) for t #0; fis C” on R. Is it C° on R?] 

7. Prove (1.4), that is, prove that g o fis differentiable at t = to and that its 
derivative is given by (1.4). 
8. Prove Corollary 1.7. 


eS 


m 


Sometimes it is important to extend the definitions of differentiability, C', and so on, to 
functions defined on a subset A c R", which is not assumed to be an open set, for example, a 
function f(t) of one variable on the closed interval 0 < t < 1. We say that fis differentiable, of 
class C”, of class C”, and so on, on A if f can be extended to a differentiable, C", C® function, 
respectively, on an open subset U of R” which contains A. 


9. Show that if A = {xe R" |a < x’ < bi, a < bi i = 1, ..., n} and fis dif- 
ferentiable on A, then the value of éf/éx' at any point of A is independent 
of the extension chosen. Can you find any example to show that for 
some sets A this is not the case? If so, does assuming C! help? 


2 Differentiability of Mappings and Jacobians 


In this section we generalize the ideas of the previous section to the case 
of functions defined on subsets of R” but whose range is in R” rather than 
R. We will refer to them as mappings (or maps) and, insofar as possible, re- 
serve the term function for real-valued functions as in Section 1. If n‘: 
R”—R denotes the projection to the ith coordinate, namely, 


m(x',...,x4,...,x")=x', and if F:A—>R” is a mapping defined on 
A c R", then F is determined by its coordinate functions f = n’ > F; in fact 
for x € A, 


F(x) = (f 1x), nF (x) 


Conversely, any set of m functions f+, ..., f™ on A with values in R deter- 
mines a mapping F: A > R” with the coordinates of F(x) given by 
f(x), ..., fx) as above. 
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We are interested in the case where U is an open set of R”, possibly all of 
R". Since many authors identify R” and V™ (see Section I.1), these are 
sometimes referred to as vector-valued functions on R”, although we will not 
use that terminology. From general topology we know that F is continuous 
if and only if its coordinate functions are. We shall say that F is differentiable, 
of class C", C”, C®, and so on, at ae U or on U if each of its coordinate 
functions has the corresponding property. We may sometimes call a C® 
mapping F a smooth mapping; if F is smooth, then each coordinate function 
fi possesses continuous partial derivatives of all orders and each such deriva- 
tive is independent of the order of differentiation. 

If F is differentiable on U, we know that the m x n Jacobian matrix 


of? g ôf 
êxt ax" 
Of! sf") _ 
Ole yin X") : l 
opm f™ 
ax! ax" 


is defined at each point of U, its mn entries being functions on U. These 
functions need not be continuous on U ; they are so if and only if F is of class 
C!. Since differentiability is needed in Section II.6, it is useful to give another 
formulation of this concept for mappings. We leave the proof to the exer- 
cises. [Note: later differentiable will be used to mean C®; see Section III.3.] 


(2.1) A mapping F: U > R", U an open subset of R", is differentiable at 
ae U (or on U) if and only if there exists an m x n matrix A of constants 
(respectively, functions on U) and an m-tuple R(x, a) = (r' (x, a), -.., r(x, a)) 
of functions defined on U (on U x U) such that ||R(x, a)|| > 0 as x > a and for 
each xe U we have 


(x) F(x) = F(a) + A(x — a) + ||x — al]R(x, a). 
If such R(x, a) and A exist, then A is unique and is the Jacobian matrix. 


{In the expression (+), A(x — a) denotes a matrix product, so we must 
write (x — a) as ann x 1 (column) matrix and read this as an equation in 
m x 1 matrices. The last term means that each component of the m-tuple 
R(x, a) is multiplied by ||x — al||.] 

Corollary 1.6 extends immediately to mappings in the following form. 
The proof is left as an exercise. 


(2.2) Theorem Let aeU be an open subset of R" which is starlike with 
respect to a, and let F: U + R” be differentiable on U with | éf'/éx’| < K, 
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1<i<m,1<j <n, at every point of U. Then the following inequality holds 
forallxeU: 
(+x) |F(x) — F(a)|| < (am)? K]||x — al. 

We will use DF to denote the Jacobian matrix of a differentiable map- 


ping F and DF(x) to denote its value at x. If F is differentiable on U, then for 
ae U expression (*) becomes 


F(x) = F(a) + DF(a)(x — a) + ||x — al[R(x, a). 


We remark that F e C'(U) if and only if DF(x) varies continuously with 
x, that is, x > DF(x) is a continuous map of U into the space .@,,,(R) of m x n 
matrices, identified with R"” and given the corresponding topology. 

Just as in the case of functions we wish to prove a chain rule for composi- 
tion of mappings. We suppose U c R" and V c R™ are open sets and 
F: U > V c R” and G: V > R so that H = Go F is defined on U, which it 
maps into R”. We may write the coordinate functions of H using those of F 
and G: 


h'(x) = g o F(x) = g( f'(x), f"), f= lap 


(2.3) Theorem (Chain rule) Let F, G, H be as above and suppose that F is 
differentiable at ae U and G is differentiable at b = F(a). Then H = Go F is 
differentiable at x = a and we have 


DH(a) = DG(F(a)) : DF(a) 


(where ; indicates matrix multiplication). If F is differentiable on U and G on V, 
then this holds for every ae U. 


Proof According to the characterization above it is enough to show 
that the p-tuple Ry(x, a) defined by 


H(x) — H(a) — DG(F(a)): DF(a) : (x — a) = |x — a| Ry(x, a) 


approaches 0 as x approaches a. Using y = F(x), b = F(a), and the differen- 
tiability of F and G at a and b, we may write 


H(x) — H(a) = G(y) — G(b) = DG(b) - (y — b) + ||y — || Rey, b), 
and 
y — b = F(x) — F(a) = DF(a)- (x — a) + |x — al|R;(x, a). 
Then, replacing y by F(x) and b by F(a), 
H(x) — H(a) = DG(a)- DF(a)- (x — a) 


IF) — Fla 


+ [x = a | DG(F(a))  Re(x, a) + a] 


Rg (F(x), F(a))}. 
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Using the continuity of F, which is an immediate consequence of differen- 
tiability, and the properties of R(x, a) and R¢(y, b), we see that as x > a the 
expression in curly braces, which we may denote by R,,(x, a), goes to zero. 
This completes the proof. 


(2.4) Corollary If F and G are of class C (or smooth) on U and V, respec- 
tively, then H = Go F is of class CŒ (or smooth) on U. 


Proof We prove only the statement for C’, although we will use the 
general case, whose proof is a problem in mathematical induction (see 
Dieudonné [1], where it is also proved for analytic mappings). If F and G are 
C', then they are certainly differentiable, and DF and DG are continuous 
functions on U and V. Since F is C’, it is continuous and so DG(F(x)) is 
continuous on U. Finally the product of two matrices is a continuous, in fact 
C®, mapping of R™ x R™ since the entries in the m x p product matrix are 
polynomials in the entries of the factors. Thus the chain rule formula gives 
DH(x) as a composite of functions which are at least continuous so that it 
must be continuous. This is equivalent to its entries being continuous which 
means that the coordinate functions of H, and thus H itself, are of class C?. 


i 
Exercises 
1. Prove (2.1). 
2. Prove Theorem 2.2. 
3. Prove Corollary 2.4 for the case r = 2 and try to construct a procedure 


which would give the result for all r. 

4. Prove that the rank of the product of two matrices is less than or equal to 
the rank of either factor. Show that multiplying a matrix on the left or 
right by a nonsingular matrix does not change its rank. 


Just as in the case of functions, we can extend the notion of differentiability, C", C*, and so 
on, to mappings into R” whose domain of definition is an arbitrary subset A c R". We say that 
F: A > R" is differentiable, C", C”, or C® if and only if it has an extension to an open set U > A 
which is, respectively, differentiable, C", C”, or C°. As we have mentioned in Section 1, there 
exist examples to show that under these circumstances DF(x) may not be uniquely defined at 
each x€ A, that is, it may depend on the extension of F, the simplest example being that A is a 
single point. Thus one must use some care in dealing with this case. The following two 
problems involve this generalization. 


5. Let U be an open subset of R” and F: U > R",m < n, be a C' mapping. 
Suppose that F is injective (one-to-one into) and that F~': A > U, 
where A = F(U) is also of class C’. Then show that m cannot be less 
than n. (This is a weak version of the theorem of Brouwer: There exists 
no homeomorphism of an open set U of R” into R", m < n.) 
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6. Let A be a closed cube in R? and suppose F: A > R” is a mapping of 
class C!. Prove that the value of DF on A is independent of any exten- 
sion. Generalize this to other domains A and to class C®%. 


3 The Space of Tangent Vectors at a Point of R” 


Although we shall presently restrict our attention to R”, let us first con- 
sider E", or E? at least, for the sake of intuition. Our purpose is to attach to 
each point a of R” an n-dimensional vector space T,(R”). We know how to do 
this in Euclidean space: If ac E’, we let T,(E*) be the vector space whose 
elements are directed line segments X, with a as initial point. These are 
added by the parallelogram law: —X, is the oppositely directed segment 
and 0 is the segment consisting of the point a alone. We have supposed that a 
unit of length was chosen in E? and we may denote by || X,|| the length of the 
segment. Multiplication by positive (negative) real numbers leaves the direc- 
tion unchanged (reversed) and multiplies the length by the absolute value of 
the number. To show that this does indeed give a vector space of dimension 
3 over R is an exercise in solid geometry. Thus we attach to each point of E? 
a three-dimensional vector space called the tangent space at that point. 

We shall ultimately attach vector spaces at each point of more com- 
plicated spaces, namely manifolds; this was briefly indicated in Section I.5. 
There is, however, a unique feature of the tangent spaces of Euclidean space 
which is not shared by the tangent spaces at points of manifolds; the tangent 
spaces at any two points of Euclidean space are naturally isomorphic, that is, 
there is an isomorphism determined in some unique fashion by the geometry 
of the space—not chosen by us. (Without the restriction of naturality, the 
statement would be trivial since any two vector spaces of the same dimen- 
sion over the real numbers are isomorphic, but in general there is no unique 
isomorphism singled out, rather we must choose one arbitrarily from a very 
large collection.) 

Indeed, if a, b are points of E°, then there is exactly one translation of the 
space taking a to b; this translation moves each line segment issuing from a 
to a line segment from b and thus carries T,(E*) to 7,(E*). Since parallelo- 
grams go to congruent parallelograms and lengths are preserved, this corre- 
spondence is an isomorphism; and it is uniquely determined by the geometry 
(Fig.H.2). If we choose a fixed point a as origin and choose at a three linearly 
independent vectors E,,, E,,, E3,, for example, three mutually perpendicu- 
lar unit vectors, then this will automatically determine a basis not only of 
T,(E?) but also (by parallel translation) of T,(E*) for every be E°’. All of this 
is intuitive geometry and we have not really proved the statements we have 
made. Therefore we turn to R” where we are able to be more precise and 
rigorous, but we keep in mind our geometric model. 
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Figure II.2 


Let a = (a',..., a") be any point of R”. We define 7,(R"), the tangent 
(vector) space attached to a, as follows. First, as a set it consists of all pairs of 
points (a, x), or aX, a = (a',..., a") and x = (x’,..., x"), corresponding, of 
course, to initial and terminal points of a segment. We also denote such a 
pair by X,, using upper case letters for vectors. We next establish a one-to- 
one correspondence g,: T,(R") > V” between the set just described and the 
vector space of n-tuples of real numbers by the following simple device: If 
X, = 4X, then 9,(X,) = (x! — a’,..., x" — a"). Finally the vector space 
operations (addition and multiplication by scalars) are defined in the one 
way possible so that p, is an isomorphism. This requires that 


Xa + Y, = Pa (PalXa) + Pal ¥a))s 
aX, = Pa (29(Xa)), ZER, 


the right-hand side being used to define the operations on the left. Clearly we 
are being guided by the fact that R" and E" may be identified if we choose 
rectangular Cartesian coordinates in Æ". This is equivalent to choosing an 
origin 0 and n mutually orthogonal unit vectors there, (E;)o,-.., (Eno. lying 
on each (positive) coordinate axis—as do i, j, and k in the usual model for 
E?. Then vectors at any point a are uniquely determined by their compon- 
ents relative to the basis E,,,..., Ena, which in turn are given by subtracting 
from the coordinates of the terminal point of each vector, the coordinates of 
its initial point a. The geometry of E" has guided us to a proper method for 
defining the tangent space at each point of R”. Please note that V” has a 
canonical basis e! = (1, 0,..., 0), ..., e" = (0,..., 1) and this gives at each 
ae R" a natural or canonical basis E,, = 9, (e1), .-., Ena = Pa (€n) Of 
T,(R"). The canonical isomorphism given by translation in the case of £" is 
now o; ° a: T,(R") > T(R"), and we have X, = ax corresponding to 
Y, = by if and only if x' — a = y' — b', i = 1, 2,..., n. However, we never 
identify the tangent spaces into a single vector space as is often done in 
discussions of vectors on Euclidean space, that is, we never equate vectors 
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with different initial points; in particular. we cannot add a vector in T,(R") 
and one in 7,(R") where a Æ b. The reason for our insistence on this point 
will appear when we learn how to attach a tangent space T,(M) to each point 
p of a manifold in general, for then we have nothing corresponding to the 
natural isomorphisms of T,(E*) and 7,(E°) given by the translations of E’. 
For example, there is no natural isomorphism of the tangent vectors to S? at 
two distinct points p and q of S°. It follows that our method of defining 
T,(R") at each a—which depended on such an isomorphism—is not suitable 
for generalization in its present form. Therefore we shall give two further 
methods for defining 7,(R"), one in this section with details left as exercises 
and a second, which we use in the remainder of the text, in the section 
following this one. 


ž(0)= (0) 


x(t) = (x), x2 (4) 


Figure II.3 
Equivalent Curves: x(r) and y(t). 


We begin by a formal description of the first definition. Let x(t), 
—g < t < £, be a C! curve in R” passing through a e R” when t = 0, that is, 
assume x(t) = (x'(t),..., x"(t)), where x‘(t) is C! and x‘(0) = af i = 1,..., n. 
Let I, = A |t | < e}. Then each such curve is a C! map of J, > R", 
where ¢ > 0 and may vary from curve to curve. Two curves are equivalent, 
x(t) ~ y(t), if at t = 0, the derivatives with respect to t of their coordinate 
functions are equal: x'(0) = j(0), i= 1, ..., n. Let [x(t)] denote the equiv- 
alence class of x(t); to each [x(t)] corresponds an n-tuple of numbers 
x(0) = (x'(0), ..., "(0)), that is, an element of V". Using this map we obtain 
a vector space structure on the collection of equivalence classes which we 
denote, predictably, by T,(R"). Details are left as exercises. Intuitively speak- 
ing, if we use the identification of R” with E” plus a rectangular Cartesian 
coordinate system, we see that x'(0) is the ith component of the velocity 
vector of the particle whose motion is given by x(t) = (x'(t),..., x"(t)) at the 
instant it passes through a (see Fig. II.3). Two curves are equivalent if they 
represent two motions with the same velocity at this instant. 
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Exercises 


1. Show that the map [x(t)] > (x'(0), ..., x"(0)) is one-to-one and onto p”, 
so that it can be used to define the structure of a vector space on the 
collection T,(R") of equivalence classes. 

2. Prove that this definition of T,(R”) is equivalent to the earlier one of the 
present section. 

3. Using a standard method of definition of the tangent plane to S?, the 
unit sphere in R°, show that the vectors of T,(R*), a e S*, which belong 
to equivalence classes [x(t)] determined by curves lying on S?, determine 
a subspace of T,(R°) and that this subspace may be naturally identified 
with the tangent plane to S? at a. 

4. For each of the vectors E;,, a€ R” and i= 1,..., n, identify the equiv- 
alence class of curves corresponding to it by defining a particularly 
simple curve in the class. This gives an interpretation of the canonical 
basis of T,(R”). 


4 Another Definition of T,(R") 


In this section we give a characterization of the space of tangent vectors 
attached to a point a of R” which we shall later use in extending this concept 
to manifolds. In spite of its formal and abstract nature it is relatively easy to 
work with; it is hoped that some intuitive clarification has resulted from the 
earlier definitions. 

Let us denote by C®(a) the collection of all C® functions whose domain 
includes a, identifying those functions which agree on an open set containing 
a—since we are only interested in their derivatives at a. Let X, = )?.., a'Eig 
be the expression for a vector of 7,(R") in the canonical basis; we define the 
directional derivative Af of f at a in the “direction of X,” by 
Af = Y7., œ Of/ox', Of/éx' evaluated at a = (a',..., a"). This is a slight ex- 
tension of the usual definition in that we do not require X, to be a unit 
vector. Since Af depends on f, a, and X, we shall write it as X* f. Thus 


no ô 
Xžf= Zef) . 


We may take the directional derivative in the “direction of X,” of any 
C°” function defined in a neighborhood of a. Hence f > X* f defines a map- 
ping assigning to each fe C”(a) a real number 

Xž: C” (a) > R. 


It is reasonable to denote this mapping by X*¥ = ł 7-1 2,(@/0x'), where we 
must remember that the derivatives are to be evaluated at a. We remark that 
X#x' = a',i = 1,...,n, so that the vector X, is completely determined if its 
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value on every C* function at a is known—or even on the functions 
f(x) = x. 

We have agreed not to distinguish between C* functions f, g in C*(a) if 
they agree on some open set containing a. Two functions of C*(a) may be 
added or multiplied to give another element of C%(a), whose domain is the 
intersection of their domains. If xe R, then fis a C” function with the same 
domain as f, so fe C” (a) implies afe C*(a); the same result would be ob- 
tained by multiplying f by a C® function whose value is x on some open set 
about a. Thus C%(a) is an algebra over R containing R as a subalgebra. 
Remembering the fundamental properties of derivatives we see at once that 
if «x, B are real numbers and f,g are C* functions defined in open sets 
containing a, then we have 

(i) Xžlef + Bg) = «(Xz f) + P(Xžg) (linearity) 
and 

(ii) X#(fo) = (X2 fsla) + f(a Xžg) (Leibniz rule). 

Let 2(a) denote all mappings of C*(a) to R with these properties; we 
may call the elements of 2(a) “derivations” on C*(a) into R. We see that 
2(a) is a vector space over R for if D,, D2: C*(a) > R and «, Pe R, then we 
define (wD, + BD) f = «(D,f) + B(D2f), where the operations on the right 
are in R. This defines in (a) both addition and multiplication by real 
numbers «, 8. This is the standard procedure for defining a vector space 
structure on maps of a set into a field. One must check that the vector space 
axioms are indeed satisfied by these operations. In particular, it must be 
verified that if D e 2(a), then «D € 2 (a), and if D,, D, € P(a), then so also are 
D, + D,. This means checking the linearity of «D:C*(a)>R and 
D, + D,: C” (a) > R and checking that the Leibniz rule is satisfied. We do 
this for yD only. Suppose then y, «, Be R, De P(a), and f, ge C*(a). Then 


(Daf + Bg) = [Daf + Bg)] (by definition of yD) 
= y[x(Df) + B(Dg)] (by property (i)) 
= ya(Df) + yB(Dg) (by the distributive law of R) 
= a(yD) f+ BiyD)g (by our definition of yD). 


It follows that the map yD: C®(a) —> R is linear. That yD satisfies the Leibniz 
rule for differentiation of products is equally easy: 


(DNJ) = AD) (by definition of yD) 
= y[(Df g(a) + f (a)(Dg)] (by property (ii)) 
= y(Df )g(a) + f (a)y(Dg) (these being real numbers) 


= ((yD)f)o(a) + f(aX(yD)g) (by definition of yD). 
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As was remarked, a similar verification shows that D, + D, is a derivation 
into R; it is left as an exercise. 

The correspondence X, > X* associates to each element X, of T,(R") an 
element of 2 (a), namely the mapping X*: C*(a) > R defined by taking the 
directional derivative of fe C*(a) at a in the direction X,. This mapping 
from T,(R") > G(a) is one-to-one since X¥ = Y* means that X* f= Yž ffor 
every fe C*(a) which implies X, = Y,. Indeed we have noted the ith com- 
ponent of X, relative to the natural basis is just X*x' so that 
Xa = } $- (Xžx)Ea = Y. Finally, it is easy to see that this mapping is 
linear. If Z, = «aX, + BY, T,(R"), then for the directional derivatives we 
have for any fe C*(a), 


Zi f= (Xf) + BYES). 


If interpreted in terms of the operations in Y(a), this means exactly that the 
mapping T,(R") > 2(a) is linear. In summary then, X, > X* defines an 
isomorphism of the vector space T,(R”) into the vector space Y(a), which 
allows us to identify T,(R”) with a subspace of 2 (a). However, more can be 
said; in fact this isomorphism is onto, and we have the following theorem. 


(4.1) Theorem The vector space T,(R") is isomorphic to the vector space 
(a) of all derivations of C®(a) into R. This isomorphism is given by making 
each X, correspond to the directional derivative X* in the direction of X,. 


To prove the theorem it only remains to show that every derivation of 
C*(a) into R is a directional derivative, that is, that X, > Xf is a map onto 
Q(a). This will result from two lemmas. 


(4.2) Lemma Let D be an arbitrary element of Z(a). Then D is zero on any 
function fe C*(a) which is constant in a neighborhood of a. 


Proof Because the map D is linear, it is enough to show that if 1 denotes 
the constant function of value 1, then D1 = 0. However, D1 = D(1-1) = 
(D1)1 + 1(D1) = D1 + D1 = 2D1, so D1 = 0. We must remember in inter- 
preting these equalities that multiplying fe C” (a) by a real number a gives 

. exactly the same result as multiplying by the C” function whose value is 
constant and equal to « in some open set (possibly R”) containing a, at least 
as far as the algebra C*(a) is concerned: we have identified R with the 
subalgebra of such functions. I 


(4.3) Lemma Let f(x’, ..., x") be defined and C”? on some open set U. If 
ae U, then there is a spherical neighborhood B of a, B < U, and C®-functions 
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g', ..., g" defined on B such that: 


o so- (34) 


and 


(ii) S,- x") = f(a) + Lier (x! — a')g'(x). 
Proof Let B c U be a spherical neighborhood of a and note that for 
xe B, f(x) = f(a) + Jo (3/ðt)f (a + t(x — a)) dt. Hence, 


f(x) =f(a) + Èe — a’) j |2 aat 


Let 
: fê 
g(x) = | E dt, i=1,...,n; 
=o ex at+t(x—-a) 
these are C*-functions and satisfy the two conditions. | 


Proof of Theorem 4.1 Using these lemmas we may complete the proof 
of Theorem 4.1. Suppose D is any derivation on C%(a). We wish to show 
that, given D e Y(a), there is a vector X, € T,(R") such that for any fe C*(a), 
we have X* f = Df. If this be so, then X* = D and we see that every deriva- 
tion of C®(a) into R is a directional derivative; thus the map X, > Xž of 
T,(R") to Aa) is an isomorphism onto. 

Let hi(x!,..., x") = xi Then denote by «' the value of Dh’, that is, 
ai = Dh’. Consider X, = )7-; 4; Eia; as an operator on C*(a), it gives 


“= Selaa), 


On the other hand, by Lemma 4.3, f(x) = f(a) + V7, (x! — a')g'(x) on 
some B,(a) in the domain of f. Restricting to B,(a) and using the properties of 
D, we may write 


Df = D(F(a)) + X {(D(x' — a’)ja(a) + 0+ Dg’. 


By Lemma 4.2, D(f(a))=0 and D(x‘ — a’) = Dx’ = a; and by 
Lemma 4.3, g'(a) = (¢f/éx'),. Therefore Df = )7_, a'(éf/0x'), = X# f. Since 
fis an arbitrary element of C*(a), we have D = X%. This completes the 
proof. i 


Theorem 4.1 allows us to identify the vector space T,(R") with the space 
(a) of linear operators on functions of C*(a) into R which satisfy the 
product rule of Leibniz, that is, the “derivations into R.” 
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Note that under this identification the canonical basis vectors 
Eia, -.., Ena Of T,(R") are identified with ¢/éx', ..., 6/éx", the directional 
derivatives (evaluated at x = a) in the directions of the coordinate axes: 


Ea > EX f= 1 á 


We will make this identification from now on for vectors in T,(R") and 
for this reason we will drop the asterisk » which distinguishes the vector X, 
as a segment or point pair from the directional derivative: X* f will be 
written X,f. In R" we may use either E;, or @/éx‘ to denote the unit vector 
parallel to the ith coordinate axis. This characterization of T,(R”) requires 
C°” functions; although C’(a) is an algebra, it is known to have other deriva- 
tions than directional derivatives. Our situation is then that we shall rely on 
Euclidean space for our geometric intuition of the space of tangent vectors at 
a point, but in formal definitions and proofs we will use the ideas above: a 
vector at a point is a linear operator of a certain kind—satisfying the pro- 
duct rule for derivatives—on the C® functions at the point. 


Exercises 


1. Letae Rand f, g be two C” functions whose domains of definition both 
contain a. We shall say fis equivalent to g at a, f ~ g, if and only if they 
agree On some open set containing a. Show that this is an equivalence 
relation and that the collection of equivalence classes, which we call 
germs of C” functions at a, is an algebra over R. [It is precisely this 
algebra which is C®(a).] 

2. Do the statements of Exercise 1 remain true if we replace C” by C’, 
r > 0? Suppose that we merely require equality of fand g and all of their 
derivatives at a, does this give an equivalence relation and an algebra 
over R, or is equality on an open set needed? 

3. Prove that the collection of all maps of a set X into a field F has a 
natural vector space structure. Follow the definitions indicated for 2 (a). 

4. Show that if D, and D, are in (a), then D, + D, is also in Ga). 

Using the definition (Section 3) of equivalence of C' curves through 

a € R”, prove that the mapping of T,(R"), defined as “velocity vectors,” 

to D(a), taking the class [x(t)] to the operator D defined by 


n ô ; 
b= x) (0) , 


dl 


is independent of the choice of the curve x(t) in [x(t)] and determines an 
isomorphism of T,(R") onto Y(a). 
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5 Vector Fields on Open Subsets of R” 


A vector field on an open subset U c R" is a function which assigns to 
each point pe U a vector X,e¢7,(R"). A similar definition applies to 
Euclidean space E". There are many examples in physics for n = 2 and 
n = 3. The best known is the gravitational field: If an object of mass p is 
located at a point 0, then to each point pin U = F” — {0}, there is assigned a 
vector which denotes the force of attraction on a particle of unit mass placed 
at the point. This vector is represented by a line segment or arrow from p (as 
initial point) directed toward 0 and having length kyu/r?, r denoting the 
distance d(0, p) and k a fixed constant determined by the units chosen (see 


a Sg 
> NS 
S Xq >. 
` ~ 
> \ 
oN a \ 
Ne x. be p \ 
K eT" 
\ 


\ 
\ 


Figure 11.4 
First quadrant portion of gravitational field of point mass at origin. 


Fig. II.4). If we introduce Cartesian coordinates with 0 as origin and 4 = 1, 
then for the point p with coordinates (xt, x”, x°) the components of X, in the 
canonical basis are 


kx! —kx? —kx3 
with = Ox)? + (2)? + 07)”, 


r > r? 2 r? 
that is, 

oa , , -ki,@ ,@ 4@ 
Xp = sy (XE + X° Bap + X Esp) = — 3 se ee a ae Re IE 


We note that the components of X „are C* functions of the coordinates. 
We shall say that a vector field on R” is C” or smooth if its components 
relative to the canonical basis are C® functions on U. Unless otherwise 
stated, all vector fields considered will be assumed to have this property, 
although it is quite possible to define continuous, C', and so on, vector fields 
also. When dealing with vector fields, as with functions, the independent 
variable will be omitted from the notation. Thus we write X rather than X, 
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just as we customarily use f rather than f (p) for a function. Then X, is the 
value at p of X, that is, the vector of the field which is attached to p—it lies in 
T(R”). 

Further examples of vector fields are given for each i = 1,..., n by the 
fields E; = 6/¢x' which assign to every pe R” the naturally defined basis 
vector E; at that point. The vector fields E,,..., E, being independent, even 
orthogonal unit vectors, at each point p form a basis there of T,(R"); such a 
set of fields is called a field of frames. The vector fields X,, X, on 
U = R° — {0} defined by X, = x'E, + x?E, and X, = x°E, — x'E, also 
define a field of frames; geometrically X ,, is a vector along a ray from 0 to p 
and X3, is a vector perpendicular to it, that is, tangent to the circle through p 
with center at 0 (see Fig. II.5). It is often convenient, as we know from 
elementary mechanics to use other frames (even in Euclidean space) than E, 
and E,. 


Figure II.5 


If X is a C*-vector field on U and f a C” function on U, then Xfis the 
C” -function on U defined by (Xf)(p) = X „f. Indeed, if the components of X 
are the functions «'(p),..., «"(p) so that X = )7_, aE;, then 


(XN) = È o(a) - 


i=} 


We see from the right-hand side that Xf is a C® function of p on U since 
x`(p)e C”(U) and 6f/éx'e C” (U). Thus f} Xf maps C”(U) > C”(U). 


PDF pdfFactory Pro www.fineprint.cn 


5 VECTOR FIELDS ON OPEN SUBSETS OF R” 39 


We note also that C*(U) is an algebra over R with unit, where R is 
identified with the constant functions and, in particular, the constant 
function 1 with the unit. It is natural to ask whether X is a linear map of 
C*(U) to C*(U) and more generally whether it is a derivation, that is, 
satisfies the Leibniz product rule. In fact, this is so, for we may write 


[X(af + Ba)\(p) = X (af + Bg) = A(X, f) + B(X,9) 
= a(Xf)(p) + B(Xg)(p), 


and 
[X( fa) lp) = (X, f alp) + f(X 9) 
=[(XS )(p)lo(p) + f (P(X gp). 


Since the functions on the right and left agree for each pe U, they are equal 
as functions. Thus X: C*(U) > C*(U) is a derivation which maps C*(U) 
into itself, a slight variation from the previous case. (This, in fact, is the 
customary use of the term “ derivation” ofan algebra. If A is an algebra over 
R, then a derivation is a map D: A > A which is linear and satisfies the 
product rule of Leibniz. For example, @/¢x is a derivation on the algebra of 
all polynomials in two variables x and y.) 

We conclude this section by proving an important property of 
C*-functions which, with the corollary given here, is used very often in 
discussions of vector fields (see the exercises). It is a “separation theorem” 
and contrasts strongly the behavior of C* and C® functions on R". (There 
exist stronger versions of this theorem as we shall see later.) 


(5.1) Theorem Let Fc R” be a closed set and K c R" compact, 
FaK = Ø. Then there is a C” function o(x) whose domain is all of R" and 
whose range of values is the closed interval [0, 1] such that o(x) = 1 on K and 
o(x) = 0 on F. 


Proof We prove the theorem in two steps. 


(a) Let B,(a) be an open ball of center a and radius £. We show that 
there is a C” function g(x) on R” which is positive on B,(a), identically 1 on 
B,)., and 0 outside B,(a). The function h(t) defined by h(0) = 0 fort < 0 and 
h(t) = e~ '" for t > Qis C% since we can prove by direct computation that all 
of its derivatives exist and are zero at t = 0, and since it is analytic for other 
values of t. We let 


k? — IIxl® 
(e? — Ixl) + Axl? — 32°) 


Since the denominator is never zero [at either £? — ||x||? or |]x||? — je? or at 
both h is positive], this is a C” function. When ||x|| > £, the numerator is 


g(x) = h 
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zero, otherwise it is positive; and when 0 < ||x|| < $e, the value of g(x) is 
identically 1. Thus g(x) is C®, vanishes outside B,(0), and is positive on its 
interior; in fact g(x) = 1 for x e B,,.(0). Hence g(x) = g(x — a) has the desired 
properties. 

(b) For step two let B,(a;), i = 1,..., k, bea finite collection of n-balls in 
R" — F such that | )*-, B,2(a;) > K. It is possible to find such a collection 
because of the compactness of K and since K ^ F = Ø. For B,(a;) let g,(x) 
have the properties above and define o(x) by 


ole) = 1-1-9) 


On each xe K at least one g; has the value 1, o(x) = 1,sog = 1 on K. 
Outside | )#., B,(a;) each g; vanishes so o(x) = 0 and, since F lies outside 
this union, ø = 0 on F. This completes the proof. | 


(5.2) Corollary Let f(x', ..., x") be C” on an open set U c R" and let 
ae U. Then there is an open set V c U, which is a neighborhood of a, and a 
C? function f *(x!, ..., x") defined on all of R" such that f*(x) = f(x) for all 
xe V and f*(x) = 0 for x outside U. 


Proof Choose any neighborhoods V,, V, of a such that Yew; 
V, c U and F, is compact. Let V, = K and F = R" — V, in Theorem 5.1. 
Then take o(x), a C® function whose value is 1 on V, and 0 outside V2, a 
is, on F. Define f*(x) = o(x) f(x) for xe U and f*(x) = 0 for xe R" — 
Since f* thus defined is C” on U, where it is equal to of, and is ae A 
R" — V, where it is identically zero, and since on the (open) intersection 
U — V, of these two sets, both definitions agree, we see that f* is C” on R” 
and has the properties needed. i 


Exercises 


1. Prove that the function h(t) defined above does in fact have derivatives 
of all orders at t = 0 and that they all have the value zero there. Why 
does this imply that h(t) is not analytic at t = 0? 

2. Suppose a vector X „€ T,(R”) is given at each p € U, an open subset of R". 
Show that this defines a C® vector field if and only if for each fe C” (U), 
Xfis C” on U. 

3. Let D be a derivation on the algebra C*(U). If f, g are in C*(U) and 
f(x) = g(x) for all xe V, an open subset of U, then prove (Df)(x) 
= (Dg)(x) at each point x e V. 

4. Using the preceding problem show that the derivations Z(U) on C*(U) 
define a vector space over R and that this space is isomorphic with the 
space of C” vector fields on U, which we denote ¥(U). Show that these 
are infinite-dimensional vector spaces. 
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5. If D, and D, are derivations on an algebra A, show by example that 
D, D, need not be a derivation but that D, Da — D, D, isa derivation. If 
we consider A=C%(U), U openc R", X =) a'(x)d/dx' and 
Y => f'(x) ô/ðy e ¥(U), then this means that XY — YX = } y'(x) 
6/éx'. Verify this and compute y;(x). 

6. Give examples of linear operators on C*(U), U an open subset of R”; 
which are not derivations of C*(U). 

7. Let peU, an open subset of R”, and let X,€ 7,(R") be a vector at p. 
Show that X, may be extended to a C® vector field X on U. 

8. In Theorem 5.1, assume only that K is closed (not necessarily compact). 
Does the theorem still hold? 


6 The Inverse Function Theorem 


In order to simplify the terminology of this and later sections we intro- 
duce the notion of diffeomorphism, or differentiable homeomorphism, 
between two spaces. Of course this concept can have no meaning unless the 
spaces are such that differentiability is defined, which at the present moment 
means that they must be subsets of Euclidean spaces. Therefore without 
prejudice to a more general later treatment, let us suppose that U c R” and 
V c R" are open sets. We then shall say that a mapping F: U > V is a 
C’-diffeomorphism if: (i) F is a homeomorphism and (ii) both F and F~' are 
of class C”, r > 1 (when r = œ we simply say diffeomorphism). It is perhaps 
not obvious why we need to require both F and F~' to be of class C’—it is 
because we wish the relation to be symmetric. As the following example 
shows, the differentiability of F~' is not a consequence of that of F, even 
when F isa homeomorphism. Let U = Rand V = Rand F: ths = t°; this 
is a homeomorphism and F is analytic but F~': set = s!? is not Ct on V 
since it has no derivative at s = 0. 

One might suspect that our definition contains some redundant require- 
ments as in fact it does—in two ways. First, as shown in Exercise 6, it would 
not be possible to have a diffeomorphism between open subspaces of 
Euclidean spaces of different dimensions; indeed a famous theorem of alge- 
braic topology (Brouwer’s invariance of domain) asserts that even a homeo- 
morphism between open subsets of R” and R", m n, is impossible. 
Secondly, in the example given above the derivative of F vanishes at t = 0, 
thus behaving atypically: If it vanished everywhere, then F could not be a 
homeomorphism of R to R and if it vanished at no point, then F~' would 
indeed be a differentiable map (please argue this through!). We can certainly 
see at once that if F: U + V isa homeomorphism and both F and F7 ' are of 
class C! at least, then DF(x) is nonsingular, that is, has nonvanishing deter- 
minant at each x € U; for F7! o F = I, the identity map of U to U and by the 
chain rule DI(x) = DF~ '(F(x))- DF(x). Since DI(x) is just the identity 
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matrix for every xe U, DF(x) is nonsingular and its determinant is never 
zero. This includes the assertion that, if F: R > R is a diffeomorphism, then 
its derivative can never be zero. The main theorem of this section will have 
as a consequence Corollary 6.7, which is the converse of this statement. 
Before proving it, however, we consider two examples of diffeomorphisms of 
R" to R". 


(6.1) Example Let F: R" — R" be the translation taking a = (at, ..., a") 
to b = (b',..., b"). Then F is given by 


F(x',..., x") = (x! + (b! — a'),..., x" + (b" — a")), 


or F(x) = x + (b — a). The coordinate functions f'(x) = x‘ + (b' — a') are 
analytic, and hence C”. The translation G(x) = x + (a — b) is F~! which is 
then also C” and since F, F~' are defined and continuous, F is a homeo- 
morphism. Thus F is a diffeomorphism. 


(6.2) Example Let F: R" > R” be a linear transformation 
F(x!,..., x") = ( N ee 2), 
\j=l j=i 


or, using matrix notation with x as ann x 1 (column matrix) and A = (qj), 
F(x) = Ax. Computation shows that DF(x) is the constant matrix A, 
DF(x) = A. If det A + 0, then A has an inverse B and the homogeneous 
linear transformation G(x) = Bx is F~'. On the other hand if det A = 0, 
then F is not one-to-one, in fact it maps at least a line through the origin 
onto the single point 0 = (0, 0, ..., 0). Obviously F is analytic and C” in 
either case, so that F is a diffeomorphism if and only if DF(x) = A is 
nonsingular. 


Diffeomorphism is an equivalence relation among the open subsets of R”. 
We have the following lemma, whose proof we leave as an exercise, which 
gives the transitivity property; symmetry and reflexivity are part of the 
definition. 


(6.3) Lemma Let U, V, W be open subsets of R", F: U > V, G: V > W 
mappings onto, and H = Geo F: U > W their composition. If any two of these 
maps is a diffeomorphism, then the third is also. 

We now state the main theorem of the section. 
(6.4) Theorem (Inverse Function Theorem) Let W be an open subset of 


R" and F: W > R” a C’ mapping, r = 1,2, ..., or 00. Ifae W and DF(a) is 
nonsingular, then there exists an open neighborhood U of a in W such that 
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V = F(U) is open and F: U >V is a C diffeomorphism. If xe U and 
y = F(x), then we have the following formula for the derivatives of F~’ at y: 


DF! (y) = (DF(x))"', 


the term on the right denoting the inverse matrix to DF(x). 


This is one of the two basic theorems of analysis on which all of the 
theory in this book depends; the other is the existence theorem for ordinary 
differential equations (Chapter IV). The proof used here depends on the 
following fixed point theorem; a variety of proofs may be found in advanced 
calculus books. 


(6.5) Theorem (Contracting Mapping Theorem) Let M be a complete 
metric space with metric d(x, y) and let T: M > M be a mapping of M into 
itself. Assume that there is a constant 4,0 < 4 < 1, such that for all x, ye M, 


d(T (x), T(y)) < 4 d(x, y). 
Then T has a unique fixed point a in M. 


Proof Applying T repeatedly we see that d(T"(x), T"(y)) < 4" d(x, y). 
In particular, if we choose arbitrarily x) € M and let x, = T”(Xo), then we 
assert that d(x,,X,;+m) < 4"K, K = 0, a constant independent of n, m. Using 
T"*™(xXo) = T"(T™(Xo)), we write d(X,,Xn+m) < 2" d(xo, T"(Xo)). By the 
triangle inequality 
(Xo, T™(Xo)) < d(xo , T(Xo)) + d(T (Xo), T?(x0)) 
+ + d(T" (xo), T™(xo)) 


1 
<(1+144? +: +4" ') dxo, T(xo)) < 1274: T (Xo)). 


This shows that we may take 
1 
K = poy do 5 T(Xo)) 
and proves the assertion. Thus {x,} is a Cauchy sequence and has a limit 
point a. Since T(x,) = x,,, obviously has the same limit we see that 
d(T (a), a) = lim d(T(x,), Xn) = lim d(x,41, Xn) = 0 


so T(a) = a and a isa fixed point of T. There could not be two fixed points 
a, b for then d(a, b) = d(T (a), T(b)) < A d(a, b) contradicting the fact that 
A<1. i 


Proof of Theorem 64 We shall organize the proof of the inverse func- 
tion theorem in several steps in order to make it somewhat easier to follow. 
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(i) We assume F(0) = 0 and DF(0) = I, the identity matrix. 


This may be done without loss of generality by virtue of Lemma 6.3 
combined with the use of Examples 6.1 and 6.2. Next we define the mapping 
G on the same domain by 

G(x) = x — F(x). 
Then, obviously, G(0) = 0 and DG(0) = 0. (Note: In the last equation the 
right-hand side is the 0 matrix.) 


(ii) There exists a real number r > Q such that DF is nonsingular on the 
closed ball B,,(0) < W and for x,, x, € B,(0) we have 


(+) |G(x1) — G(x2)l| < żl|x1 — all 
and 
(+*+) [xi = xl] < 2|F(x1) — F(x2)l- 


To verify these statements we choose r so that B,,(0) = W ; further so 
that det(DF(x)), which is a continuous function of x and not zero at 0, does 
not vanish on B,,(0); and finally so that the derivatives of the coordinate 
functions of G, all of which are zero at 0, are bounded in absolute value by 
1/2n on B,,(0). With these assumptions, x,,x2¢B,(0) implies 
|x, — X2|| < 2r and Theorem 2.2 with m = n gives (+). Inequality («*) re- 
sults from replacing G(x;) by x; — F(x;), i = 1, 2, in («) and using a standard 
property of norms: 


[x1 — F(x1) — x2 + F(x2)|| < žlxı — x2] 
by (*), but 
[xi = X2| = []F(1) — F(x2)] < Or - x2) + F(x1) — F(x2)|. 
Combining these gives (x+). As a consequence of (+) we obtain the following: 


(iii) If |x|] < r, then ||G(x)|| < r/2, that is, G(B,(0)) = B,/2(0). Moreover 
for es y€ B,/2(0) there exists a unique x € B,(0) such that F(x) = 


The first statement is immediately obtained from (+) by setting x, = x 
and x, = 0; the second uses Lemma 6.3. If ye B,/(0) and x e B,(0), then 


ly + G) < [yl + |G) sar + ar = r- 


Let a mapping T,: B,(0) > B,(0) be defined for ye B,,2(0) by T(x) = y + 
G(x). Then T,(x) = x if and only if y = x — G(x) or, equivalently, F(x) = y. 
However, inequality (*) in the form 


IT) — Bll = |G) — G) < ala — x2] 


PDF pdfFactory Pro www.fineprint.cn 


6 THE INVERSE FUNCTION THEOREM 45 


valid for x,, x2 € B,(0), implies that T,(x) is a contracting mapping of the 
compact set B,(0) into itself. Therefore by Theorem 6.5 there is a unique x 
such that y = F(x). Since this is valid for any ye B,,.(0), we see that F~' is 
defined on that set. In particular, F being continuous, U = F~ '(B,,(0)) is 
an open subset of B,(0). Let V = B,,2(0); since B,(0) = W we see that: 


(iv) F is a homeomorphism of the open set U c W onto the open set V. 


It remains only to prove continuity of F~ +, which is a consequence of the 
inequality (++). Whenever x,, x€ U, we have y; = F(x,) and y, = F(x2), 
and (+*+) becomes 


[F= y) — Fo"(y2)l] < 2y — yall, 
which implies that F~': V > U is continuous. 


(v) Let b = F(a) be in V. Then F ' is differentiable at b and DF”! (b) = 
[DF(a)]~', the matrix inverse to DF(a). 


Since F is of class C’, r > 1, on W it is differentiable on all of U, in 
particular, at a = F~'(b). Thus by definition 


F(x) — F(a) = DF(a)- (x — a) + ||x — allr(x, a), 


where r(x, a) > 0 as x > a. By (ii), DF(a) is nonsingular and we let A be its 
inverse matrix. Multiplying the above expression by A and using y = F(x), 
x = F~'(y), and a = F~ 1(b), and so on, we obtain 


A: (y — b) = F> (y) — F-'(b) 
+ |F") — F~ (b) A «(Fo (y), Fo '(b)} 
This, in turn, gives 
F—!(y) = F~} (b) + A: (y — b) + lly — Bly, b) 
if we suppose y # b and define 


_ FQ) — FT) 
ly — Bj 
Inequality (++) shows that the initial fraction is bounded by 2, A is a matrix 


of constants, and F~'(y) is continuous so it is clear that lim, .,7(y, b) = 0 
which proves the differentiability of F7 + at any be V and shows that 


iy, b) = 


A+ (F~ (y), F"(b)). 


DF- !(b) = A = [DF(a)]~* 
as claimed. The following statement completes the proof. 


(vi) If F is of class C" on U, then F~* is of class C’ on V. 
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For ye V we have just seen that 
DF- (y) = [DF(F (y) *- 


Since F~‘(y) is continuous as a function of y on V and its range is U, since 
DF is of class C’~! and nonsingular on U, and since, finally, the entries in the 
inverse of a nonsingular matrix are C” functions of the entries of the matrix, 
it follows that DF ~' is continuous on V, thus F~! is of class C? at least. In fact 
if F`! is of class k < r, the entries of DF~ + are of class k — 1 at least on V, but 
the formula above for them shows these entries to be given by composition of 
functions of class C* or greater and hence to be of class C* at least. This 
implies F~! is of class C**'; so by induction F~* is of class C”. This 
completes the proof. I 


The following two corollaries are immediate consequences of 
Theorem 6.4. We use the notation of the theorem, that is, W is an open 
subset of R” and F: W > R". 


(6.6) Corollary If DF is nonsingular at every point of W, then F is an open 
mapping of W, that is, it carries W and open subsets of R” contained in W to 
open subsets of R". 


(6.7) Corollary A necessary and sufficient condition for the C® map F to be 
a diffeomorphism from W to F(W) is that it be one-to-one and DF be nonsingu- 
lar at every point of W. 


Exercises 


1. Carry the proof of Theorem 6.4 through in detail for the case r = 1, 
making any simplifications you can. 

2. Prove Lemma 6.3. 

3. Compute the Jacobian matrix for Examples 6.1 and 6.2. 

4. Show that for transformations on a compact subset K of R" which 
satisfy the conditions of the contracting mapping theorem except that 
0 < å < 1 (we allow å = 1), there still exists a fixed point. [Hint: Con- 
sider mappings T, = ((n — 1)/n)T.] 

5. Prove Corollary 6.6. 

6. Prove Corollary 6.7. 

7. Prove that there does not exist a C! diffeomorphism from an open 
subset of R” to an open subset of R” if m < n. 


7 The Rank of a Mapping 


In linear algebra the rank of an m x n matrix A is defined in three 
Bae ways: (i) the dimension of the subspace of V" spanned by the 
prt t 2 Caesia of the subspace of V” spanned by the columns, and 
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(iii) the maximum order of any nonvanishing minor determinant. We see at 
once from (i) and (ii) that the rank A < m, n. The rank of a linear transfor- 
mation is defined to be the dimension « of the image, and one proves that 
this is the rank of any matrix which represents the transformation. From 
this it follows that, if P and Q are nonsingular matrices, then 
rank(PAQ) = rank(A). 

When F: U —> R” is a C! mapping of an open set U c R”, then 
DF(x) has a rank at each xe U. Because the value of a determinant is a 
continuous function of its entries, we see from (iii) that if rank DF(a) = 
then for some open neighborhood V of a, rank DF(x) > k; and, if k = 
inf(m, n), then rank DF(x) = k on this V. In general, the inequality is possible: 


F(x, x?) = ((x!)? + (x?)?, 2x!x?) 


has Jacobian 


1 2 
DF(x', x?) = fs 2x ). 


2x? 2x! 


whose rank is 2 on all of R? except the lines x? = +x!. The rank is 1 on 
these lines except at (0, 0) where it is zero. 

We shall refer to the rank of DF(x) as the rank of F at x. If we compose F 
with diffeomorphisms, then the facts cited and the chain rule imply that the 
rank of the composition is the rank of F, since diffeomorphisms have nonsin- 
gular Jacobians. We say F has rank k on a set A, if it has rank k for each 
xe A. We use these definitions in stating the following basic theorem. 


(7.1) Theorem (Rank Theorem) Let Ao =c R", Bọ = R" be open sets, 
F: Ao > Bo be a C" mapping, and suppose the rank of F on Ag to be equal to k. 
If ae Ag and b = F(a), then there exist open sets A c Ag and B c Bo with 
ae A and be B, and there exist C" diffeomorphisms G: A > U (open) = R", 
H: B= V (open) c R” such that H © F > G~‘(U) c V and such that this map 
has the simple form 


H o F o G7 (ys xX”) = (x}, ..., x*, 0, ..., 0). 


Before proceeding to the proof we make some general comments. This is 
clearly an important theorem for it tells us that a mapping of constant rank k 
behaves locally like projection of R" = R* x R"~* to R* followed by injec- 
tion of R* onto R* x {0} c R* x R™-* = R”. This is an important tool and 
we shall use it frequently; later it will be rephrased in terms of local coordin- 
ates (cf. Chapter HI, Remark 4.2). It implies Theorem 6.4 as a special case. 


Proof To begin with, we may suppose a = 0, the origin of R”, and 
b = 0, the origin of R”. If the theorem holds for this case, then it may be 
seen to hold in yt since composition of F with two translations 
F(u + a) — b, which has the property that F(0) = 0. By similar 
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suppose that a k x k minor of nonzero determinant in DF(a) is 


of? ef* 

oul uk 
Ce ae . 
Cs) ; i 

of* 7 of* 

bul uk m4 


the upper left k x k minor. 
We define the C’-mapping G: Ay > R” by 


CO ..., u") = (f (ut, ..., co EW i CIA tt} 
Then 


of aft 
ĉu! ðu" 
DG=[ ` : * 
ĉu! ĝu” 
0 Tn-k 


where J,,_, is the (n — k) x (n — k) identity, the terms in the lower left block 
are zero, and those in the upper right do not interest us. This matrix is 
nonsingular at u = a, hence there is in Ag an open set A, containing a on 
which G is a diffeomorphism onto an open subset U, = G(A,). From the 
expression for G and the definition of U, we have Fe G '(0)=0, 
FoG~'(U,) < Bo, and 
F o G4 cg ca x") = (x1, SSP Weal") 

with f**4(x) = f**/ o G7 !(x). We may verify this by remembering that G~' 
is one-to-one on U, and for! = 1,...,k the values of x' are f'(u), ue G~*(U,), 
sof!o G7 !(x) = f'(u) = x'. So far we have used only the fact that the rank of 
DF at a (hence in a neighborhood of a) is at least k. We have not used the 
fact that it is identically k on Ay, but we need this in the next step which 
requires that the rank be at most k. We compute D(F o G~ ') from the for- 
mula above for F > G~’, giving 


i, 0 
of*+! oft? 
D(F o G')(x) = Oxi" ôx" 
* : 
gm g” 
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This is valid on U,, where F o G~! is defined. On the other hand, DG~' is 
nonsingular on U, and G~'(U,) = A, © Ao. Therefore, 


rank D(F o G7!) = rank(DF ° DG~') =k 


on U,, which implies that all terms in the lower right-hand block of the 
matrix are zero on U}, that is, the functions f‘*',...,f" depend on 
xt, ..., x* only. 

Now we define a function T from a neighborhood V, of 0 in R” into 
Bo c R" by the formula 


TY VP ee TeV VV FT Wee) 
ee J” + JOS 3 3") 
The domain V, is subject to two restrictions, first it must be small enough so 
that for y = (y',..., y")e V, the functions f**4(y', ..., y*) are defined and 


second, small enough so that T(V,) = Bo. It is clear that T(0) = 0. If we 
compute DT, we see that it is nonsingular everywhere on V, since it takes the 


form 
DT(y) = (" x ) 


* Ink 


Therefore T is a C” diffeomorphism of a neighborhood V of 0 in V, onto an 
open set B c R”; the origin of R” is in Band B c B,. Choose a neighbor- 
hood U c U, of the origin in R” such that F o G~'(U) c B;let A = G~1(U) 
and let H = T~?. Then 


G1 F H 
U — A — B —— V 


are C" maps of these open sets and G~', H are C' diffeomorphisms onto A 
and V, respectively. Finally we see that 


H o F o C7 Gt ..., x5, 5+1, a x") = Oe HO, ER” 


since 


On the other hand, according to its definition above, T must take the value 
(xt, aaa, XE JEE H, at eel oe) if we set y! = xi for i= 1, 
...,k and yf = 0 for i = k + 1, ..., m. Because T is one-to-one, it follows 


h -t tak xl., Be JEt Hx, x) e, IE ci X 
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(7.2) Corollary We may choose the neighborhoods U and V in either of the 
following ways: (i) U = Bt(0) and V = BM(0) or (ii) U = C:(0) and 
V = C0) with the same £ > 0 for both U and V. Then if n denotes the 
projection of R” = R* x R"~* to R* andi: R* > R* x R" “is the injection to 
the subset R* x {0}, we have n œ H o Fe G`! o iis the identity on B;(0) in case 
(i) or on C}(0) in case (ii). 


Exercises 


1. Let H” c R" be defined by H” = {x | x" > 0} and let GH” = {x | x" = 0}, 
x" being the last coordinate of x = (x1,..., x"). We see that 0H" = R" '. 
Suppose that U, V are (relatively) open sets of H” and F: U > V, 
G: V > U are C maps and are inverses of one another. Show that if 
U' = Un éH", V' = V a ôH", we must have that F | U’ and G | V’ are 
one-to-one onto, inverses of each other, and C” maps. 

2. Prove Corollary 7.2. 


We say that m functions f ' (x), ..., f(x) of class C’ defined on an open subset U c R" are 
dependent if there exists a C' function F(y', ..., y”) on R” which does not vanish on any open 
subset but such that F( f'(x), ...,f"(x)) = 0 on U. 


3. Show that if f'(x),...,f"(x) are dependent, then the rank of 
alf! ..., f™)/A(x', ..., x") is less than m. Also as a partial converse show 
that if the rank is less than m and constant on U, then the functions are 
dependent locally. 


Notes 


The implicit function theorem, which is proved in many advanced calculus texts, is essen- 
tially equivalent to the inverse function theorem proved in the last section. For a proof of this 
latter theorem, which is not based on the contracting mapping theorem, see Spivak [1]. We have 
used the form of proof above, since this same principle may be applied to give a proof of the 
existence of solutions of systems of ordinary differential equations. All of these theorems are 
treated in a unified and very elegant way by Dieudonné [1], although there is a disadvantage for 
many readers in the fact that Banach space, rather than R” is used throughout. S. Lang [1] also 
presents a very good treatment along the same lines. A discussion of the contracting mapping 
theorem and a sketch of its use in proving the existence theorem for differential equations may 
be found in the work of Kaplansky [1]. 

Although many of the theorems found here are valid for C’ functions and mappings and for 
C? also, the latter is too restrictive for most of our needs (theorems like those of Section 5 do 
not hold), and C is not strong enough to make Lemma 4.3 hold, which means that the 
characterization of T,(R") given in Section 4 would have to be abandoned. For this reason, and 
since it is very convenient to know that we do not lose differentiability as a result of taking 
derivatives, i.e., the derivatives of a C” function are also C”, C® is the preferred differentiability 
class in much of differentiable manifold theory. We shall therefore consider functions and 
mappings of class C* almost exclusively hereafter. 
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In the first section we give a precise definition of aC” manifold of dimension n: a topologi- 
cal manifold together with a covering by compatible coordinate neighborhoods, that is, a 
covering such that a change of local coordinates is given by C* mappings in R". Several 
examples are worked out in detail, the most complicated being the Grassmann manifold of 
k-planes through the origin of R". 

In Sections 3 and 4 both C” functions and mappings of manifolds are defined in terms of 
local coordinates, as is the rank of a mapping, that is, the rank of the Jacobian (in local 
coordinates). This enables us to consider certain examples of mappings of maximum rank 
(immersions and imbeddings) and leads to the definition of submanifold and regular submani- 
fold (Section 5). We require the latter to be subspaces and to be defined locally by the vanishing 
of some of the coordinates of suitable local coordinates in the ambient space. 

In Section 6 the concept of Lie group is defined. These are groups which are C” manifolds 
such that the group operations are C® mappings. It is shown that some standard matrix groups 
are Lie groups, for example, the group of orthogonal n x n matrices. In the following section 
the action of a Lie group on a manifold is defined and discussed. The group of rigid motions of 
E" is an example. 

The chapter concludes with a discussion of the special case of the properly discontinuous 
action of a discrete Lie group on a manifold and a brief introduction to covering manifolds. 
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1 The Definition of a Differentiable Manifold 


As a preliminary to the definition of a differentiable manifold, we recall 
the definition of a topological manifold M of dimension n; it is a Hausdorff 
space with a countable basis of open sets and with the further property that 
each point has a neighborhood homeomorphic to an open subset of R”. Each 
pair U, ọ, where U is an open set of M and 9 is a homeomorphism of U to 
an open subset of R”, is called a coordinate neighborhood: to q € U we assign 
the n coordinates x' (q), ..., x"(q) of its image g(q) in R"—each x'(q) is a 
real-valued function on U, the ith coordinate function. If q lies also in a 
second coordinate neighborhood V, wọ, then it has coordinates y'(q), ws 
y"(q) in this neighborhood. Since and y are homeomorphisms, this defines 
a homeomorphism 


bog ':Q(UNV)>u(Un V), 


the domain and range being the two open subsets of R” which correspond to 
the points of U ^ V by the two coordinate maps ¢, w, respectively. In coor- 
dinates, y o g~' is given by continuous functions 


yt = BG ..., x"), i= 1,...,, 


giving the y-coordinates of each qe U ^ V in terms of its x-coordinates. 
Similarly gow! gives the inverse mapping which expresses the x- 
coordinates as functions of the y-coordinates 


xi egy ad"), i= l,...,n. 
The fact that o o Y~! and y ° ~ ' are homeomorphisms and are inverse to 
each other is equivalent to the continuity of h(x) and g'(y), ij = 1,....7 


together with the identities 


hi(g(y),... 90) Ey t= Lem 
and 
gi(h' (x), w+, h'(x)) = x, j=l, 


Thus every point of a topological manifold M lies in a very large collec- 
tion of coordinate neighborhoods, but whenever two neighborhoods overlap 
we have the formulas just given for change of coordinates. The basic idea 
that leads to differentiable manifolds is to try to select a family or subcollec- 
tion of neighborhoods so that the change of coordinates is always given by 
differentiable functions. 


(1.1) Definition We shall say that U, ọ and V,w are C”-compatible if 
U ^ V nonempty implies that the functions h'(x) and g/(y) giving the change 
of coordinates are C” ; this is equivalent to requiring ọ ° Y~ + and yo g ‘to 
be diffeomorphisms of the open subsets g(U ^ V) and Y(U a V) of R”. (See 
Fig. III.1.) 
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Figure II.1 


(1.2) Definition A differentiable or C” (or smooth) structure on a topologi- 
cal manifold M is a family Y = {U,, @,} of coordinate neighborhoods such 
that: 


(1) the U, cover M, 

(2) for any «,ß the neighborhoods U,,g, and U,,q@, are 
C”-compatible, 

(3) any coordinate neighborhood V,y compatible with every 
U,,9,€ & is itself in X. 


A C” manifold is a topological manifold together with a C”-differentiable 
structure. 


It is, of course, conceivable that for some topological manifold no such 
family of compatible coordinate neighborhoods can be singled out. It is also 
conceivable that, on the contrary, families can be chosen in a multiplicity of 
inequivalent ways so that two inequivalent C” manifolds have the same 
underlying topological manifold. These are basic but very difficult questions, 
and in fact, are matters of recent research (see the notes at the end of this 
chapter). What is important from our point of view is that we will be able to 
find an abundance of topological manifolds with at least one differentiable 
structure, thus an abundance of C” manifolds; so we may ignore these more 
difficult questions in what we do here. 
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Since there is no danger of confusion, we will often say simply “ mani- 
fold” for C” manifold; we may also sometimes say differentiable or smooth 
manifold. Moreover, “coordinate neighborhood ” will hereafter refer exclu- 
sively to the coordinate neighborhoods belonging to the differentiable struc- 
ture. Should we have occasion to consider a manifold without differentiable 
structure, we will say topological manifold and topological coordinate 
neighborhood. 


By requiring only that the change of coordinates be given by C” functions for r < oc, we 
could define C’-compatible coordinate neighborhoods and C’ manifolds, C° denoting a topolo- 
gical manifold. One can also require that the change of coordinates be C”, that is, real-analytic. 
We shall restrict ourselves almost exclusively to the C® case. 


Before proceeding we will prove the following proposition, which will 
make it easier to give examples of differentiable manifolds; it shows that (1) 
and (2) of Definition 1.2 are the essential properties defining a C® structure. 
Thus in examples we need only check the compatibility of a covering by 
neighborhoods. 


(1.3) Theorem Let M be a Hausdorff space with a countable basis of open 
sets. If { Vg, Wp} is a covering of M by C”-compatible coordinate neighborhoods, 
then there is a unique C® structure on M containing these coordinate 
neighborhoods. 


Proof We shall define the differentiable structure to be the collection % 
of all topological coordinate neighborhoods U, œ which are C*®-compatible 
with each and every one of those of the given collection {V,, yg}. This new 
collection naturally includes the V,, Yg and so property (1) of Definition 1.2 
is automatically satisfied. As to property (2), suppose U, g and U’, g’, Un 
U' + Ø, are in the collection we have defined. Then are they 
C®”-compatible? Since they are (topological) coordinate neighborhoods, the 
functions g’ > g@' and go g’! giving the change of coordinates are well- 
defined homeomorphisms on open subsets of R”, and we need only be sure 
that they are C”. Let x = ọ(p) be an arbitrary point of ọ(U ^ U’). Then 
peV, for one of the coordinate neighborhoods Vz, wW. Therefore 
W=V,.UcU' is an open set containing p, and g(W) is an open set 
containing x. We have g’og '='°oWs'obgeg * on o(W), but 
gow ‘and W,° @ | are C” since U, o and U’, g' are both C*-compatible 
with V, Wy. It follows that their composition g’ ° œ~? is C* on g(W); and 
since it is C” on a neighborhood of any point of its domain, it is C”. This 
proves everything except property (3), which is automatic: Any U, that is 
compatible with all of the coordinate neighborhoods in our collection cer- 
tainly has this property with respect to the subcollection {V,, Wg}, and is thus 
in the differentiable structure. 
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(1.4) Remark It is important to note that a coordinate neighborhood 
U, ọ depends on both the neighborhood U and the map g of U to R”. If we 
change either, then we have a different coordinate neighborhood. For exam- 
ple, if Vc U is an open subset, then V,@|V is a new coordinate 
neighborhood—technically at least—although the coordinates of pe V are 
the same as its coordinates in the original neighborhood. If pe U, we may 
always choose V so that (V) is an open ball B?(a), or cube C"(a), in R” with 
o(p) = a as center. We may further alter œ by composing it with a translation 
0: R" — R" taking (p) = a to (0,0,...,0). Then V, y with y = 0° ọ|¥V is a 
new coordinate system, pe V c U, w(p) = (0,0,...,0), and Y(V) = B}(0) or 
Cx(0). 


Using the theorem just proved, we give some preliminary examples of 
manifolds. 


(1.5) Example (The Euclidean plane) (See comments in Section I.2.) 
Once a unit of length is chosen, the Euclidean plane E? becomes a metric 
space. It is Hausdorff and has a countable basis of open sets; the choice of an 
origin and mutually perpendicular coordinate axes establishes a homeomor- 
phism (even an isometry) y: E? > R?. Thus we cover E? with a single 
coordinate neighborhood V, y with V = E? and w({V) = R?. It follows not 
only that Æ? is a topological manifold, but by Theorem 1.3, V, y determines 
a differentiable structure, so E? is a C® manifold. 

There are many other coordinate neighborhoods on E? which are 
C*-compatible with V, y, that is, which belong to the differentiable struc- 
ture determined by V, y. For example, we may choose another rectangular 
Cartesian coordinate system V’, Y’. Then it is shown in analytic geometry 
that the change of coordinates is given by linear, hence C” (even analytic!) 
functions 

yt = xt cos 0 — x? sinð +h, y?=x!'sin@+ x? cos 0 + k. 

Note that V = V’, but the coordinate neighborhoods are not the same 
since wy’ + y, that is, the coordinates of each point are different for the two 
mappings. 

It is also possible to choose as U the plane minus a ray extending froma 
point 0. Using the angle @(q) measured from this ray to 0g and the distance 
r(q) of q from 0 as coordinate functions on U we define a homeomorphism 
(4) = (r(q), 0(q)) from U to the open set {(r, 8) |r > 0,0 < 8 < 27} in R?. 
The equations for change of coordinates to those above, assuming that 0 is 
the origin and that the ray is the positive x-axis, are 

x! = r cos 0, x? =rsinð, 
which again are analytic, thus C”. If the origin and axes are not chosen in 
this special way, then we must compose this mapping on R? with a rotation 
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and translation of the type above to obtain the functions giving the change 
of coordinates. The various coordinate neighborhoods just enumerated 
being C”-compatible with our original V, y are in the differentiable struc- 
ture on E? determined by it. 

In the same manner Euclidean space of arbitrary dimension n gives an 
example of a C”? manifold, covered by a single coordinate system. Again, this 
may be done in a variety of ways. As we have noted it is customary to 
identify E" and R” since the former is difficult to axiomatize; this is equiva- 
lent to choosing a fixed rectangular Cartesian coordinate system covering all 
of E". Many examples will make it abundantly clear that manifolds in gen- 
eral can not be covered by a single coordinate system nor are there preferred 
coordinates. Thus it is often better in thinking of Euclidean space as a 
manifold to visualize the model Æ" of classical geometry—without 
coordinates—rather than R”, Euclidean space with coordinates. (However, 
we will later follow common practice and identify £" and R".) 

A finite-dimensional vector space V over R can be identified with 
R", n = dim V, once a basis e;,...,e, is chosen: v= x'e; + + x"e, is 
identified with (x', ..., x") in R”; similarly, the m x n matrices (a;;) with R™ 
with the matrix A = (a;;) corresponding to 


(aiiis lin eei ngisis dmn): 


Using these identification mappings we may define a natural topology and 
C® structure on V and on the set .%,,,(R) of m x n matrices over R. We 
suppose them to be homeomorphic to Cartesian or Euclidean space of 
dimension n in the case of V, and mn in the case of ./,,,(R) and covered by a 
single coordinate neighborhood, the identification map above being the 
coordinate map. 


(1.6) Example (Open submanifolds) An open subset U of a C™ mani- 
fold M is itself a C” manifold with differentiable structure consisting of the 
coordinate neighborhoods V’, y’ obtained by restriction of y, on those coor- 
dinate neighbohoods V, y which intersect U, to the open set V’ = V ^ U, 
that is, Y’ = Y |V ^ U. This gives a covering of U by topological coordinate 
neighborhoods which are C”-compatible, and hence defines a C” structure 
on U. which is said then to be an open submanifold of M. 

A particular case of some interest is the following. We consider the subset 
U = Gl(n, R) of M E” M (R), n x n matrices over R, which consists of all 
nonsingular n x n matrices. Since an n x n matrix A is nonsingular if and 
only if its determinant det A is not zero, we have 


U = {Ae.M,({R)| det A + 0}, 


which is the usual definition of the group Gi(n, R). Since det A is a polyno- 


mial function of its entries a,j, it is a continuous function of its entries and of 
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A in the topology of identification with R”. Thus U = Gl(n, R) is an open 
set—the complement of the closed set of those A such that det A = 0, and 
we see that Gl(n, R) is an open submanifold of .7,(R). 


(1.7) Theorem Let M and N be C” manifolds of dimensions m and n. Then 
M x N is a C” manifold of dimensionm + n with C* structure determined by 
coordinate neighborhoods of the form {U x V, @ x y}, where U, p and V,w 
are coordinate neighborhoods on M and N, respectively, and (p x W)(p, q) = 


(o(p). W(q)) in RE" = R" x RE 


The proof is left as an exercise. An important example is the torus 
T? = S! x S!, the product of two circles (see Fig. II.2). More generally, 
T” = S! x + x St, the n-fold product of circles is a C” manifold obtained 
as a Cartesian product. 


Figure IJ1.2 


(1.8) Example (The Sphere) We give a fairly detailed proof, using 
Theorem 1.3, that the unit 2-sphere S$? = {xe Rẹ? | ||x|| = 1} isa C* manifold 
(see Fig. III.3). The idea used is an elaboration of that discussed in 
Section I.3. It extends in an obvious way to S”~ t, the unit n — 1 sphere in R”. 
A somewhat simpler method, using stereographic projection, is left to the 
exercises; it also extends to S”~'. 

We take S? with its topology as a subspace of R?, that is, U is open in S? 
if U = Ŭ n S? for some open set U c R°. This implies that S? is Hausdorff 
with a countable basis; we shall show that it is locally Euclidean. For i = 1, 
2, or 3, let Uj = {(x!, x?, x°) |x? > 0} and U7 = {(x, x?, x°) | x! < 0}; 
these U? are two open sets into which the coordinate hyperplane x; = 0 
divides R?. The relatively open sets U* = U* ^ S?,i = 1, 2, 3, cover S*. We 
define of : U* — R? by projection: 
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Figure HI.3 


These are homeomorphisms to the open set W = {xe R? | ||x|| < 1} as is 
easily checked; thus S? is locally Euclidean and a topological manifold. 
However, the formulas for the change of coordinates are C”, and thus these 
coordinate neighborhoods are C”-compatible. For example, gf ° (p3) ' is 
given on Ut n Uz by composing (3) * and gf 


(x, x3) ie (x!) —(1 = (x4)? = (x3)2)12, x3) 


(xt, = (1 = (xt = ep), 08) Se (= (1 = (xt (2), x°). 


Then, by change of notation, using (u', u?) as Uz -coordinates and (v', v?) as 
U} -coordinates instead of (x', x?) and (x?, x°), we have 


o! = -(1- (w)? Wy, a. 


The v' are C* functions of the ut since the square root term is never 
zero on {(u',u?)|(u')? + (u?) < 1}. Similarly, gz e(gy) * is C” on 
{(v', v?)|(v!)? + (v?)? < 1}. Thus the coordinate neighborhoods Uy, t and 
Uz, @z are C*-compatible. This covering of S? by eight coordinate neigh- 
borhoods determines a C” structure. 


Thus S? is an example of a manifold which is a subset of another mani- 
fold, namely R?. Very many examples will be of this type as will be seen later; 
they are called submanifolds (to be defined). A two-dimensional submanifold 
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of E> or R? is often called a surface and a one-dimensional submanifold a 
curve; planes and spheres, circles and lines are the simplest examples. 
Classical differential geometry dealt extensively with these two cases. 

Coordinate neighborhoods are often called charts and a collection of 
compatible charts which cover a manifold M an atlas, a terminology 
suggested by charts or maps of the globe. An atlas thus determines a 
differentiable structure on M. A complete atlas is maximal, i.e., satisfies (3) of 
Definition 1.2. 


Exercises 


1. Prove Theorem 1.7. 

2. Using stereographic projection from the north pole N(0, 0, + 1) ofall of 
the standard unit sphere in R? except (0, 0, — 1) determine a coordinate 
neighborhood Uy, gx. In the same way determine by projection from 
the south pole S(0, 0, — 1) a neighborhood Ug, @, (see the accompany- 
ing figure). Show that these two neighborhoods determine a C* struc- 
ture on S?. Generalize to S"~!. 


x? 


3. Let Mm(R) be the space of all real m x n matrices and .@%,,(R) be the 
subset of all those m x n matrices whose rank is > k. Show that ./#+,,(R) 
is an open subset of ./,,,,(R) and thus a manifold. 

4. We say that two (C®) atlases on M are equivalent if their union is an 
atlas. Show that this is an equivalence relation and that each equivalence 
class contains a unique complete atlas. 

5. Let X beaset and {U,, y,} a covering by subsets U, each with a bijective 
map g,:U,—V,, an open subset of R,. If these charts are all C” 
compatible, i.e., all pge@,' are C”, does this data define a unique 
manifold topology—with p, '-images of open subsets of V, for all « as 
basis of open sets—and a corresponding C® structure on X? 
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2 Further Examples 


In this section we shall discuss two related examples of “ abstract” mani- 
folds, that is, manifolds which are not defined as submanifolds of Euclidean 
space. The first example is the space of classical (real) projective geometry, 
the second—technically more difficult to define than any example we have 
met thus far—is the Grassman manifold, consisting of all k-planes through 
the origin in R”, n > k; it will be treated more fully in Section IV.9. Both 
examples—in fact the latter includes the former—arise from equivalence 
relations defined on simpler manifolds, the underlying space of the new 
manifold being the set of equivalence classes with a suitable topology. 

Let X be a topological space and ~ an equivalence relation on X. 
Denote by [x] = {ye X | y ~ x} the equivalence class of x, and for a subset 
A c X, denote by [A] the set (_),. 4 [a], that is, all x equivalent to some 
element of A. We let X/~ stand for the set of equivalence classes and denote 
by n: X > X/~ the natural mapping (projection) taking each xe X to its 
equivalence class, n(x) = [x]. With these notations we define the standard 
quotient topology on X/~ as follows: U < X/~ is an open subset if n~ '(U) 
is open; the projection z is then continuous. 


(2.1) Definition With the above notation and topology we shall call X/~ 
the quotient space of X relative to the relation ~. 


As a simple example let X = R the real numbers and let Z be the 
integers. We define x ~ y if x — ye Z and denote by R/~ the quotient 
space. We shall leave as an exercise the proof that this quotient space may be 
naturally identified with S' = {ze C| |z| = 1}, the unit circle in the com- 
plex plane, and that z: R > R/~ is then identified with the map z(t) = 
exp(2zt,./—1). Note that X/~ is a space of cosets of a group relative to a 
subgroup; this situation occurs frequently. 


(2.2) Definition An equivalence relation ~ ona space X is called open if 
whenever a subset A c X is open, then [A] is also open. 


Our examples will usually be open equivalence relations. The following 
lemma will show why. 


(2.3) Lemma An equivalence relation ~ on X is open if and only if n is an 
open mapping. When ~ is open and X has a countable basis of open sets, then 
X/~ has a countable basis also. 


Proof Let A c X bean open subset. Since [A] = n~ '(n(A)), we see by 
definition of the quotient topology on X/~ that [A] is open if z is open and 
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conversely [A] open implies 2(A) is open. Now suppose ~ is open and X has 
a countable basis {U;} of open sets. If W is an open subset of X/~, then 
ee = a jej U; for some apa of {U} and W = x(x '(W)) = 
rex @(U,). It follows that {z(U;)} is a basis of open sets for X/~. 


This lemma is clearly useful in determining those equivalence relations 
on a manifold M whose quotient space is again a manifold, for a manifold 
must be a Hausdorff space with a countable basis of open scts. Unfor- 
tunately, there is no simple condition which will assure that the quotient 
space is Hausdorff. In fact, as Exercise 2 shows, a quotient space X/~ may 
be locally Euclidean with a countable basis of open sets and still fail to be 
Hausdorff. Nevertheless we obtain important examples by this method, 
sometimes with the assistance of the following lemma. 


(2.4) Lemma Let ~ be an open equivalence relation on a topological space 
X. Then R = {(x, y)|x ~ y} isa closed subset of the space X x X if and only 
if the quotient space X/~ is Hausdorff. 


Proof Suppose X/~ is Hausdorff and suppose (x, y)¢ R, that is, x 4 y. 
Then there are disjoint neighborhoods U of x(x) and V of z(y). We denote 
by U and V the open sets x~'(U) and x ‘(V), which contain x and y, 
respectively. If the open set U x V containing (x, y) intersects R, then it 
must contain a point (x’, y’) for which x’ ~ y’, so that 2(x’) = 2(y’) contrary 
to the assumption that U ^ V = Ø. This contradiction shows that U x V 
does not intersect R and that R is closed. 

Conversely, suppose that R is closed, then given any distinct pair of 
points x(x), x(y) in X/~, there is an open set of the form U x V containing 
(x, y) and having no point in R. It follows that U = x(U) and V = x(V) are 
disjoint. Lemma 2.3 and the hypothesis imply that U and V are open. Thus 
X/~ is Hausdorff. | 


(2.5) Example (Real projective space P"(R)) We let X = R"*' — {0}, all 
(n + 1)-tuples of real numbers x = (x+, ..., x"*') except 0 = (0,..., 0), and 
define x ~ y if there is a real number t # 0 such that y = tx, that is, 


(Yi, ces Dead) = (tx,,..., (X41). 


The equivalence classes [x] may be visualized as lines through the origin 
(Fig.III.4). We denote the quotient space by P”(R); it is called real projective 
space. We prove: 


P"(R) is a differentiable manifold of dimension n. 


To do so we first note that x: X > P"(R) is an open mapping. If t + 0 is a 
real number, let g,: X + X be the mapping defined by ọ,(x) = tx. It is 
clearly a homeomorphism with o7 + = @,),. If U c X is an open set, then 
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Figure II.4 


[U] = |] o{U), the union being over all real t + 0. Since each ¢,(U) is open, 
[U] is open and z is open by Lemma 2.3. 

Next we apply Lemma 2.4 to prove that P"(R) is Hausdorff. On the open 
submanifold X x X c R"*! x R"*! we define a real-valued function 


F(x, y) by 
f(x, a rts yt Ao yo") = X (xiy ia xiy’). 
iFj 
Then f(x, y) is continuous and vanishes if and only if y = tx for some real 
number t + 0, that is, if and only if x ~ y. Thus 


R = {(x, y)|x~ y} =f *(0) 


is a closed subset of X x X and P"(R) is Hausdorff. 

We define n + 1 coordinate neighborhoods U,, g;,i = 1,...,n + 1, as 
follows: Let U; = {xe X | x' #0! and U; = 2(U,). Then ọ;: U; > R" is 
defined by choosing any x = (xt, ..., x"*") representing [x] € U; and putting 


x! xi xit? x"ti 
gx) = eo xi ingi eee xi . 


It is seen that if x ~ y, then g(x) = gy); moreover ox) = ọ;(y) implies 
x~ y. Thus g;: U; > R" is properly defined, continuous, one-to-one, and 
even onto. For ze R", g; '(z) is given by composing a C” map of R" to R"*? 


with z, namely, g; ‘(z',...,2") = x(z',...,2'-*, +1, 24,..., 2"); therefore 
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; ` is continuous. Thus P"(R) is a (topological) manifold and is C= if the 
coordinate neighborhoods are C*-compatible, that is, p; ° @ z | is C7 (where 
defined) for 1 < i, j < n + 1. The verification is simple and explicit and is 
left to the reader. This completes the proof that P”(R) is a manifold. 


(2.6) Example (Grassman manifolds G(k, n)) The Grassman manifold 
G(k, n) is the set of all k-planes through the origin of R"- or k-dimensional 
subspaces of V" = R" (as a vector space)—endowed with a suitable topology 
and differentiable structure. We will realize G(k, n) as a quotient space aris- 
ing from an equivalence relation on the manifold F(k, n) of k-frames in R”, 
where we define a k-frame in R" to be a linearly independent set x of k 
elements of R": 


Xe = (xf, ..., XP). 


A k-frame in R” may be identified with the k x n matrix, which we also 
denote by x, whose rows are x4, ..., x,. We use the fact that the set AM ,(R) 
of all k x n real matrices is a differentiable manifold by virtue of its 
identification with R*". The matrices which correspond to k-frames, that is, 
those of rank k, form an open subset and hence F (k, n) is a differentiable 
manifold. This is because of the fact that “x is of rank k” means that the 
following two equivalent statements hold: (i) the row vectors form a linearly 
independent set and (ii) not all k x k minor determinants are zero simultan- 
eously. Statement (ii) shows that the rank is less than k at the simultaneous 
zeros of a set of continuous functions on M ,,(R), that is, on a closed subset, 
so F(k, n) is open. 

Clearly each frame x determines a k-plane or point of G(k, n), namely, the 


subspace spanned by x,,..., Xp, So that we have a natural map of F(k, n) 
onto G(k, n). Moreover x = (x,,..., x,) and y = (y,,..., Yk) determine the 
same k-plane if and only if y; = }¥_4 4,;x;, where a = (x;;) is a nonsingular 


k x k matrix, that is, if and only if y = ax, the product of the matrices a and 
x. It is natural to define ~ by 


y~x if y=ax, aeGi(k, R). 


We then identify G(k, n) with F(k, n)/~, the set of equivalence classes, and 
the above mentioned natural map with z. We sketch a proof that G(k, n) 
with the quotient space topology has the structure of a differentiable mani- 
fold of dimension k(n — k). A different proof will be given in Section IV.9. 
Note that if k = 1, then ae G(1, R) = R* and G(k, n) becomes P"~ !(R). The 
proof that x is an open mapping is analogous to Example 2.5 and is left to 
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the reader. The proof that G(k, n) is Hausdorff is trickier, but is also left as an 
exercise. 

It remains to describe a covering by coordinate neighborhoods with 
C*-compatible coordinate maps so that Theorem 1.3 may be applied to 
complete the proof. We shall use the k x k submatrices of x€ M, (R) to 
accomplish this. Let J = (j;,.--,j,) be an ordered subset of (1,..., n), for 
example, J = (1,2,...,k) and J’ be the complementary subset [for the 
example then, J’ = (k + 1, ..., n)}. By x, we denote the k x k submatrix 
(xi), 1 <i, l <k, of the k x n matrix x, and by x,, we denote the com- 
plementary k x (n — k) submatrix obtained by striking out the columns 
jı- J, Of x. Let U, be the open set in F(k, n), consisting of matrices for 
which x, is nonsingular and let U, = n(Ŭ,) be the corresponding open set in 
G(k, n). Each ye U, is equivalent to exactly one k x n matrix x in which the 
submatrix x, is the k x k identity matrix; for example, if J = (1, 2,..., k), 
then x is of the form 


Low 0 Xiks U Xin 
0> 0 

x= . . 
Ov L Xkupi U Xen 


(In fact the x equivalent to a matrix y for which y, is nonsingular is given by 
the matrix formula x = y; ‘y.) 

We define o; : U; > Mun (R), identified with R*"-®, by deleting the k 
columns corresponding to J in this representative x of y, thus @,([y]) = x, 
(the matrix comprising the last n — k columns, in the example above). We 
leave it as an exercise to show that @, is properly defined and maps U; onto 
R**-®) homeomorphically and that the U,, g; for all subsets J of k distinct 
elements of (1, 2, ..., n), form a covering of G(k, n) by C”-compatible coor- 
dinate neighborhoods; a verification of this for G(2, 4), the 2-planes through 
the origin of R4, is sufficient to show how to proceed in general. As men- 
tioned, a different proof will be given later. 


Exercises 


1. Prove the statements after Definition 2.1 concerning R/~, S+, and the 
mapping z: R > R/~. Show that z is an open mapping. 

2. Let X¥ =Rx {+1} with the Cartesian topology and C® structure. 
Define an equivalence relation on X by (x, i) ~ (y, j) if either (i) x = y < 0 
or (ii) x = y and i = j. Show that X/~ is locally Euclidean and has a 
countable basis of open sets but is not Hausdorff (cf. Exercise 1.3.1). 
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3. Let a: S" > S" be the map of the unit sphere in R"*' taking each x to its 
antipodal point a(x) = — x. Show that x ~ y if y = x or y = a(x) is an 
equivalence relation and that S”/~ is naturally identified with P"(R). 

4. Give a careful definition in terms of equivalence relations of the cutting 
and pasting (identification) procedures used to obtain examples of 
topological manifolds in Section 1.4. 

5. Use the ideas of Exercise 4 to show that P?(R) can be obtained by 
defining a suitable equivalence relation on a square. 

6. Let X, Y be topological spaces, ~ an equivalence relation on X and F: 
X — Y a continuous mapping. Suppose that x; ~ x, implies F(x,) = 
F(x ) and show that F induces a continuous mapping F’: X/~ — Y such 
that F’on = F. 

7. Show that z: F(k,n) > G(k, n) is an open mapping and that G(k, n) is 

Hausdorff. [Hint: show that x ~ y if and only if a certain collection of 

(k + 1) x (k + 1) minor determinants of the 2k x n matrix whose rows 

are Xy,..., Xk; Y, ---> Yk all vanish and apply Lemma 2.4.] 

In the case G(2, 4) complete the proof that this space is a C” manifold. 

9. Let J,,...,Jy, N = (2), be the collection of distinct subsets (1, 2, ..., n) 
containing k integers and for each xe F(k, n), . Define 
a mapping ®: F(k, n) > P*(R) by (x) = [(|x,,|.---» |Xsx|)] and 
show x ~ y implies ®(x) = ®(y) so that ® defines a mapping of G(k, n) 
into P*(R). Show that this mapping is continuous and univalent. Use 
this to prove that G(k, n) is Hausdorff. 


ge 


3 Differentiable Functions and Mappings 


On a topological space the concept of continuity has meaning; in an 
analogous way, ona C” manifold we may define the concept of C% function. 
Let f be a real-valued function defined on an open set W, of a C” 
manifold M, possibly all of M; in brief, f: W, > R. If U, ọ is a coordinate 


neighborhood such that W, ^ U + Ø and if x’, ..., x" denotes the local 
coordinates, then f corresponds to a function Î (x! p o (W; ^ U) 
defined by f= fc g', that is, so that f (p) =F (x"(p (p)) = flop )) 


for all pe W, OU. We will customarily omit the aS a use the same 
letter “ f ” for f as defined on Ws and for f, its expression in local coordin- 
ates. Ordinarily this will result in no confusion; if two coordinate neighbor- 
hoods U, and V,w are involved, we will use different letters for the 
coordinates, say x',..., x" and y’,..., y". Thus for pe W; ^ U ^ V we have, 
omitting carets, 


p) = f(x (p), -~ x"(P)) = S(x (p), «+> y"(P)), 


the latter two f’s denoting f’s, or fo g~' and fo Y~}, respectively, the ex- 
pressions in local coordinates. 


PDF pdfFactory Pro www.fineprint.cn 


(3.1) Definition Using the notation above, f: W, > R is a C” function if 
each peW, lies in a coordinate neighborhood U,@ such that 
fog *(xt,..., x") = f(x}, ..., x") is C” on o(W; o U). [Clearly, a C* func- 
tion is continuous. ] 


Among the C% functions on M are the n-coordinate functions 
(x1(q), -.-, x"(q)) of a coordinate neighborhood U, œ. More precisely, if 
nè: R" > R is defined by z'(x',..., x") = x‘, these functions are defined by 
x'(q) = n’ - (q), and their expression in local coordinates, on (U), by 

PO ng eS MO xt, ag) HT (Se xi. 
As mentioned above, the caret is usually omitted so we have the statement 
x'(x!,..., x") = xÍ, i= 1,..., n,—somewhat confusing since the same letter 
is used for a function and its values. 

It is a consequence of the definition that if fis C® on W and V c W isan 
open set, then f | V is C” on V. Moreover, if W is a union of open sets on 
each of which a real-valued function fis C* , then fis C” on W. Using the C” 
compatibility of coordinate neighborhoods, it is easily verified that if fis C” 
on Wand V, y is any coordinate neighborhood intersecting W, then fo y~! 
is C” on the open set y(V œ W) in R”. 

Just as in the case of R” we proceed from definition of C® function to 
definition of C® mapping. Suppose that M and N are C” manifolds, 
W c M is an open subset, and F: W —> N is a mapping, then we make the 
following definition. 


(3.2) Definition F is a C® mapping of W into N if for every pe W there 
exist coordinate neighborhoods U, @ of p and V, y of F(p) with F(U) e V 
such that po Fe g +: p(U) > W(V) is C* in the sense of Section II.2. 


More precisely, this means that F | U: U > V may be written in local 
coordinates x',..., x" and y',..., y” as a mapping from @(U) into y(V) by 
BE sce SF pate ec peg), 


[or simply y' = f‘(x), i = 1,..., m] and each f‘(x) is C” on @(U). Note that 
C® mapping is a more general notion than C” function, the latter being a 
mapping to N = R, which is, of course, the same as R’. 


(3.3) Remark It is important to note that C® mappings are continuous; 
that their restrictions to open subsets are C” ; and that any mapping from an 
open subset W c N into M, whose restriction to each of a collection of 
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open sets which cover W is C”, is necessarily C” on W. As with 
Definition 3.1, the C® compatibility of local coordinate neighborhoods, 
Corollary II.2.4, and the remarks above show easily that the property does 
not depend on any particular choice of coordinates. Similarly it follows from 
the same corollary that composition of C” mappings is again a C® mapping. 

Many authors refer to C” manifolds, functions, and mappings as smooth. 
From now on we shall refer to differentiable manifold, function and mapping 
although this is not very logical since we previously (in Chapter IT) used this 
word in a much weaker sense than C®. One reason that C® is a desirable 
differentiability class to use is that when we later take derivatives of C” 
functions on manifolds, we obtain C” functions—in the C” case we would 
obtain C’~! functions. Thus assuming infinite differentiability relieves us of 
many irritating concerns about order of differentiability. Of course, the same 
would be true for C° (real-analytic), but this is too restrictive for most 
purposes since we are unable to obtain important theorems of the following 


type. 


(3.4) Theorem Let F be a closed subset and K a compact subset of a C” 
manifold M with F à K = Ø. Then there is a C® function f defined on M 
with values in [0, 1] which has the value +1 on K and 0 on F. 


Proof The proof of this theorem and that of the following corollary 
require a slight modification of Theorem II.5.1. This is left to the reader as 
an exercise. i 


(3.5) Corollary Let U be an open subset of a manifold M, suppose pe U, 
and let f be a C® function on U. Then there is a neighborhood V of p in U and a 
C® function f* on M such that f* = f on V and f* = 0 outside of U. 


We conclude this section with a definition, an example, and some re- 
marks on a basic problem referred to in Section 1. 


(3.6) Definition A C” mapping F: M— WN between C” manifolds is a 
diffeomorphism if it is a homeomorphism and F~'! is C”. M and N are 
diffeomorphic if there exists a diffeomorphism F: M > N. 


This extends the concept of diffeomorphism, previously defined for open 
subsets of R” only, to arbitrary C” manifolds. Diffeomorphism of manifolds 
is an equivalence relation since composition of C” maps is C” and composi- 
tion of homeomorphisms is a homeomorphism. From this transitivity fol- 
lows; reflexivity and symmetry are obvious from the definition. It is 
important that F~', as well as F, be C® as the following example shows. 
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(3.7) Example Let F: R > R be defined by F(t) = 1°. Then F is C” anda 
homeomorphism, but it is not a diffeomorphism since F~ '(t) = t’/° and this 
is not even of class C!—let alone C’—at t = 0. This same example shows 
how it is possible to define two distinct C” structures on R. The first is the 
usual one defined by letting U = Rand g: U > R be the identity map; this 
determines a C® structure on R by Theorem 1.3. We may also consider the 
structure defined by the coordinate neighborhood V, y with V = R and 
Y:V >R defined by y(t) = tè. Then gow *(t) = tt? so that U, p and 
V, y are not C®-compatible and hence not in the same differentiable struc- 
ture. However, R with its first structure—the usual one—is diffeomorphic to 
R, denoting R with its second structure, the diffeomorphism F: R > R being 
defined by F(t) = t° so that in local coordinates it is given by 


poFog ‘(t)=t. 


We have just seen, then, that two C® manifolds with the same underlying 
topological manifold but incompatible C® structures can still be diffeo- 
morphic. A fundamental question is: Can the same manifold M or homeo- 
morphic manifolds have C® structures which are not diffeomorphic? This 
was an unsolved problem for many years, and it was finally settled by 
Milnor [4] who proved the existence of two C® structures on S 7 which were 
not diffeomorphic. Further details are given in the notes. 

We conclude with a remark which is occasionally useful: A necessary 
and sufficient condition that an open set U of M, together with a mapping 
ọ: U > R", be a coordinate neighborhood is that g be a diffeomorphism of 
U onto an open subset W of R”. Conversely, if W is an open subset of R” and 
yw: W > M is a diffeomorphism onto an open subset U, then U, y~' isa 
coordinate neighborhood. We sometimes call W, y a parametrization, 
especially in the case dim M = 1. 


Exercises 


1. Show that a continuous mapping F: M > N, C” manifolds, is C” ifand 
only if for any C® function f on an open set W, c N the function f° F is 
Ce. 

2. Verify the statements of Remark 3.3. 

Prove the statement of the concluding paragraph of this section. 

4. Assume that K is a compact subset of a C”? manifold M which is 
contained in an open subset U. Prove that there is a C® function f: 
M > [0, 1] which is identically 1 on K and whose support (closure of the 
set on which f # 0) is contained in U. 

5. Let M, N, and A be C” manifolds and p,:Mx N >M, 
p2.M x N >N be projections to the factors. For (a, b)eM x N, let 
i: M >M x N be defined by i(p) = (p, b) andj: N > M x N by j(q) = 


w 
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(a, q). Show that p,, pz, i, and j are C” mappings. Show that a mapping 
F:A>M x N is C” if and only iff, = p, ° F and f, = p, ° F are C”. 

6. Suppose M and N are C” manifolds, U an open set of M, and F: U > N 
is C”. Show that there exists a neighborhood V of any peU, V c U, 
such that F can be extended to a C® mapping F*: M > N with F(q) = 
F*(q) for all qe V. 

7. Identify the set of k-frames F(k, n) of R” with the set ofk x n matrices of 
rank k in .@,,(R) and let z: F(k,n) > G(k, n) be the mapping taking 
each such matrix to its equivalence class (see Example 2.6). Show that 
this map is C” relative to the differentiable structure of F(k, n) as an 
open submanifold of ./,,,(R). 

8. Let A, B, M, N be C” manifolds and let F: A > M and G: B > N be C” 
mappings. Show that FxG:AxB->MxN is C”, where 


(F x G)(x, y) = (F(x), Gy). 


4 Rank of a Mapping. Immersions 


Let F: N >M be a differentiable mapping of C® manifolds and let 
peN. If U, g and V, y are coordinate neighborhoods of p and F(p), respec- 
tively, and F(U) c V, then we have a corresponding expression for F in local 
coordinates, namely, 


Ê = Y o Fog ':9(U) > WV). 


(4.1) Definition The rank of F at p is defined to be the rank of F at (p) (as 
in Section II.8). 


Thus the rank at p is the rank at a = (p) of the Jacobian matrix 


of”. Ff 
éx! Ox" 
oy 
ax} ex" | 4% 


of the mapping F(x',..., x") = (f1(x1,..., x")... £(x!, ..., x")) expres- 
sing F in the local coordinates. This definition must be validated by showing 
that the rank is independent of the choice of coordinates. This is left as an 
exercise—a second definition which is clearly independent of the choice is 
given in the next chapter. 


(4.2) Remark As might be conjectured, the important case for us will be 
that in which the rank is constant. In fact Theorem II.7.1 (theorem on rank) 
and its Corollary II.7.2 can be restated as follows: 
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Let F: N —> M be as above and suppose dim N = n, dim M = mand rank 

= k at every point of N. If pe N, then there exist coordinate roe 

v ọ and V, W as above such that o(p) = (0, ..., 0), Y(F(p)) = (0 0) and 
= woFog ' is given by 


FAS 6p x") = (x1, ..., x*, 0, ..., 0). 


Moreover we may assume (U) = C2(0) and (V) = Cr (0) with the same 
e>0. 


An obvious corollary to this remark is: a necessary condition for 
F: N > M to be a diffeomorphism is that dim M = dim N = rank F. Oth- 
erwise k would be either less than n or less than m, in which case the 
expression in local coordinates implies that it is not possible for both F and 
F`! to be one-to-one, even locally. For example, if k < n in the expression 
above, all points in U with coordinates of the form (0, ..., 0, Be rete he) 
are mapped onto the same point of V. 


(4.3) Definition F:N — M is said to be an immersion (submersion) if rank 
F =n = dim N(=m = dim M) everywhere. If F is an injective (one-to-one) 
immersion, then F establishes a one-to-one correspondence of N and the 
subset N = F(N) of M. If we use this correspondence to endow N with a 
topology and C® structure, then N will be called a submanifold (or immersed 
submanifold) and F: N > Ñ is a diffeomorphism. 


We note that rank F < max(m, n) at every point. It follows that if F is an 
immersion (submersion), then n < m (respectively, n > m). In general the 
topology and C® structure of an immersed submanifold Ñ c M depend on F 
as well as N, i.e., Ñ is not a subspace of M. The concept of submanifold will be 
fully discussed below after considering a number of examples of immersions 
F of R or an open interval R into R". 

We use the natural coordinates t on R and (xt, x”) on R? to define F. To 
verify that F is an immersion it is necessary to check that the Jacobian has 
rank 1 at every point, that is, that one of the derivatives with respect to t 
differs from zero for every value of t for which the mapping F is defined; this is 
left to the reader. 


(4.4) Example F:R> R°? is given by F(t) = (cos 27t, sin at, t). The 
image F(R) is a helix lying on a unit cylinder whose axis is the x?-axis in R? 


(Fig. III.5a). 


(4.5) Example F: R >R? is given by F(t) = (cos 2zt, sin 2xt). The image 
F(R) is the unit circle $1 = {(x!, x?)|(x')? + (x?) = 1} in R? (Fig. IILS5b). 


PDF pdfFactory Pro www.fineprint.cn 


4 RANK OF A MAPPING. IMMERSIONS 71 


/ 


(a) (b) 


Figure II.5 


(4.6) Example F: (1, 0) —> R? is given by 
F(t) = ((1/t) cos 2zt, (1/t) sin 27t). 


The image is a curve spiraling to (0,0) as t > œ and tending to (1, 0) as 
t > 1 (Fig. III.6a). 


(4.7) Example F: (1, 00) > R?, as in the previous example. However, F is 
modified so that the image F(R) spirals toward the circle with center at (0, 0) 
and radius 3 as t > 0. The mapping is given by 

t+1 t+1 


F(t) = (= cos 27t, 


sin 2m); 


[It is not difficult to check that the Jacobian could have rank 0, that is, both 
derivatives dx'/dt and dx?/dt could vanish simultaneously on 1 < t < œ if 
and only if cot 2xt = —tan 2zt, which is impossible (Fig. III.6b).] 


{a) (b) 


Figure II.6 
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(4.8) Example F:R — R? is given by 
F(t) = (2 cos(t — 4), sin 2(t — 37)). 


The image is a “ figure eight” traversed in the sense shown (Fig. III.7a) with 


OD So 


(a) (b) 


Figure III.7 


the image point making a complete circuit starting at the origin as t goes 
from 0 to 2z. 


(4.9) Example G:R — R? again and the image is the “figure eight” as in 
the previous example, but with an important difference: we pass through 
(0, 0) only once, when t = 3. For t > — œ and t > +0 we only approach 
(0, 0) as limit—as shown in Fig. III.7b. The immersion is given by changing 
parameter in the previous example: Let g(t) be a monotone increasing C” 
function on —%œ <t< œ such that g(0)= m, lim,- g(t)=0 and 
lim,.4 g(t) = 2x. For example, we may use g(t) = x + 2 tan~' t. Then 
G(t) is given by composition of g(t) with F(t) from the previous example: 


G(t) = F(g(t)) = (2 cos bo = a sin afa) a 5) 
(4.10) Example Again F: R > R? so that 


1 
(_-sin z) for l<t<o, 
F(t) = (\t 


(0, t + 2) for -œo <t<-l. 


This gives a curve with a gap as shown in Fig. I1I.8. For —1 < t < +1 we 
connect the two pieces together smoothly as shown by the dotted line. This 
gives a C” immersion of all of R in R? whose image is as shown. As we shall 
see, this is a useful example to keep in mind. 
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(0, 0) 


(O, -1) 


Figure II.8 


We may draw some conclusions from these examples about the nature of 
immersions. First we note that an immersion need not be univalent, that is, 
one-to-one into (injective), in the large, even though it is one-to-one locally 
as we see from Remark 4.2. Examples 4.5 and 4.8 show this since, for exam- 
ple, in both cases t = 0, +22, +4n, ... all have the same image point: (0, 1) 
in the case of the circle and (0, 0) for the figure eight. 

The second conclusion we can draw is that even when it is one-to-one, an 
immersion is not necessarily a homeomorphism onto its image, that is, 
F: N > M a one-to-one immersion does not imply that F is a homeomor- 
phism of N onto N = F(N) considered as a subspace of M. Examples 4.9 and 
4.10 show this: in the case of Example 4.9, N is the figure eight whereas N is 
the real line R—two spaces which are not homeomorphic. In the case of 
Example 4.10, N is again the real line and Ñ = F(N) as a subspace of R? is 
not locally connected at all of its points: there are points on the x*-axis such 
as (0, 4), which do not have arbitarily small connected neighborhoods; hence 
N and N = R are not homeomorphic. In any case, of course, F: N > M is 
continuous—since it is differentiable. These examples lead to the definition 
of a more restrictive concept. 


(4.11) Definition An imbedding is a one-to-one immersion F: N > M 
which is a homeomorphism of N into M, that is, F is a homeomorphism of 
N onto its image, N = F(N), with its topology as a subspace of M. The 
image of an imbedding is called an imbedded submanifold. 


We remark that Examples 4.4, 4.6, and 4.7 are imbeddings. The following 
theorem, essentially a restatement of the theorem on rank and its corollary 
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along the lines of a remark above, show that the distinction between immer- 
sions and imbeddings is a global one—it does not depend on the nature of F 
locally. 


(4.12) Theorem Let F:N > M be an immersion. Then each peN has a 
neighborhood U such that F | U is an imbedding of U in M. 


Proof According to Remark 4.2, we may choose cubical coordinate 
neighborhoods U, o and V, w of pe N m $ (p)e M, respectively, such that 
o(p) = (0, ..., 0) in R”, Y(F(p)) = (0 ) in R” with g(U) = C:(0) and 
W(V) = C7 (0) (cubes of the same breadth ‘ and such that F = y œ F o o`t, 
the expression of F in these local coordinates, is given by 


Ê(x!,..., x") = (x1, ..., x", 0, cs 0). 


To see that F |U is a homeomorphism of U onto F(U) with the relative 
topology, it is enough to see that F is a homeomorphism of C"(0) onto its 
image in C7'(0). This is because F(U) < V, an open subset of M, so the 
topology of F(U) as a subspace of M is the same as its topology as a 
subspace of V, and because g: U > C2(0) and y: V > C7(0) are homeomor- 
phisms. But it is clear that F is a homeomorphism of C"(0) onto the subset 
xt la ++. = x™ = 0 of C7(0); hence the theorem holds. I 


(4.13) Remark It is convenient to call a subset S of a cube C”(a) in R” 
an n-slice if it consists of all points for which m — n coordinates are held 
constant. For example, S = {x e C7(0)|x"*! =--- = x" = 0} is a slice 
through the center 0 = (0,..., 0) of C7(0). If V, y is a cubical coordinate 
neighborhood on a manifold M and S’ is a subset of V such that y(S') is a 
slice S of the cube YV), then S’ is called a slice of V. It is an imbedded 
submanifold of M with imbedding y~ 1: S > M. 

We note for future use that in the proof of Theorem 4.12, S’ = F(U) isa 
slice of V. In general this slice is not equal to the set V ^ F(N) but only 
contained in it, even if F is univalent and U is chosen very small. The reader 
should verify this using the preceding examples. 


Exercises 


1. Using Exercise 11.2.4, show that the rank of a mapping of C” manifolds is 
independent of the choices of local coordinates made in Definition 4.1. 

2. Show that a submersion is an open mapping and that if both manifolds 
have the same dimension, it is locally—but not necessarily globally—a 
diffeomorphism onto its image. 

3. On the “figure eight” image N of R in Example 4.9 let the topology and 
C® structure be given by the one-to-one immersion G. Show that the 
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reflection H: (xt, x?) > (xt, — x?) of R? in the x! axis, although it maps Ñ 
onto Ñ, is not a diffeomorphism of Ñ. 

4. Let F: N;—> M, i= 1, 2 be one-to-one immersions. Define them to be 
equivalent if there is a diffeomorphism G:N, > N, such that F, = 
F, ° G. Show that this is an equivalence relation (see Warner [1, Section 
1.33]). 

5. Show by example that there exist two inequivalent one-to-one immer- 
sions of R into R* with the same image. 

6. Show that composition of immersions is an immersion. 

7. Show that the map F: S"! — P""'(R), defined by F(x',..., x") = 
[x',..., x"], is C” and everywhere of rank n — 1. 

8. Let F:M >B be a C” mapping of manifolds and let A be an (im- 
mersed) submanifold of M. Show that F | A is a C® mapping into B. 


5 Submanifolds 


In this section we shall discuss in some detail the various types of 
submanifold. This term is used in more than one sense in the literature; 
however all agree that a submanifold N of a differentiable manifold M is a 
subset which is itself a differentiable manifold. The confusion arises over the 
question of whether or not it should be required to be a subspace of M, that 
is, to have the relative topology. We have adopted the definition which 
is the most general, namely, a submanifold N is the image in M of a 
one-to-one immersion F: N' > M, N = F(N’), of a manifold N’ into M 
together with the topology and C® structure which makes F: N’ > N a diffeo- 
morphism. This definition, particularly convenient for Lie groups, became 
widely used following the influential Chevalley [1]. Its disadvantages are 
clear from Examples 4.9 and 4.10 and Exercises 4.3-4.5. A more natural 
notion—which we shall now develop, is that of a regular submanifold; as its 
name implies, it will be a special case of the one above. It is more natural since 
its topology and differentiable structure are derived directly and unambig- 
uously from that of M. We will first state the characteristic feature of those 
subsets of a differentiable manifold M which are regular submanifolds; to do 
so we suppose m = dim M and that n is an integer, 0 < n < m. 


(5.1) Definition A subset N of a C” manifold M is said to have the 
n-submanifold property if each pe N has a coordinate neighborhood U, on 
M with local coordinates x',..., x" such that (i) (p) = (0,..., 0), (ii) 
g(U) = C70), and (iii) p(U ^ N) = {xe C70) |x tt =- = x" = 0}. 1f N 
has this property, coordinate neighborhoods of this type are called preferred 
coordinates (relative to N). Figure III.9 shows such a subset N in M = R? 
(n=2and m = 3). 
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Note that immersed submanifolds do not always have this property, for 
example, take p = (0,0) in Examples 4.9 and 4.10, and take note of the second 
paragraph of Remark 4.13. 

Denote by n: R” > R", n < m, the projection to the first n coordinates, 
then we may state the following lemma, using the notation above. 


M =R? 


Figure III.9 


(5.2) Lemma Let N c M have the n-submanifold property. Then N with 
the relative topology is a topological n manifold and each preferred coordinate 
system U, o of M (relative to N) defines a local coordinate neighborhood V, @ 
on N by V=UQAN and 6 = n° |V. These local coordinates on N are 
C®-compatible wherever they overlap and determine a C” structure on N 
relative to which the inclusion i: N > M is an imbedding. 


Proof Assume N has the subspace topology relative to M. Then V, @ 
are topological coordinate neighborhoods covering N; for V = U ^ Nisan 
open set in the relative topology and @ is a homeomorphism onto C;(0) = 
n(C7(0)) in R". Suppose that for two preferred neighborhoods, U, œ and 
U'’,g', V= UN and V’ = U'AN have nonempty intersection. Since 
V, ọ and V’, 9’ are topological coordinate neighborhoods, we know that the 
change of coordinates is given by homeomorphisms 6’ o @ ' and @ o (Y ', 
which we must show to be C®. Let 6: R" > R” be given by O(x’,..., x") = 
(xt, ..., x", 0,..., 0) so that z o 8 is the identity on R”. This map 0 is C® as is 
its restriction to C"(0), an open subset of R"; thus @~' = ~ * o 8 is C” since 
it is a composition of C” maps. On the other hand @’ = r ° g’, and because 
g’ isa C” map of U’ and its open subset U’ ^ U to R” and v is open and C™, 
we see that 9’ is C® on VV’. Thus 6'°@~* is C” on its domain, 
OV OV’). 
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It is even easier to see this if we write the expressions in local coordinates. 
If y' = fi(xt,..., x"), i= 1,...,m are the functions giving g' ° ~+, which 
we know to be C?”, then it is easily checked that @’° @~' is given by 
yi =f'ix',...,27,0,...,0) i=1,...,0. Therefore op") is C” by 
Definition 3.2. 

By Theorem 1.3 of this chapter, the totality of these neighborhoods 
determines a differentiable structure on N. In preferred local coordinates 
V, @, i: N > M is given on V by (x!,..., x") > (xt, ..., x", 0, ..., 0), so it is 
obviously an immersion. Because we have taken the relative topology on N, 
i: N > M is by definition a homeomorphism to its image i(N) = N, with the 
subspace topology, that is, i is an imbedding. 


The foregoing completes the proof of Lemma 5.2 and allows us to make 
the following important definition. 


(5.3) Definition A regular submanifold of a C” manifold M is any sub- 
space N with the submanifold property and with the C® structure that the 
corresponding preferred coordinate neighborhoods determine on it. (Note: N 
is trivially a submanifold, as previously defined, since i is an imbedding.) 


As an example we shall see that S? = {xe R*||/x|| = 1} is really a 
submanifold, as was indicated at the end of Section 1. If q = (a+, a”, a*) is an 
arbitrary point on S?, it cannot lie on more than one coordinate axis. For 
convenience we suppose that it does not lie on the x?-axis. We introduce the 
usual spherical coordinates (r, 0, g); they are defined on R? — {x°-axis} and if 
(1,80, Po) are the coordinates of q we may change the coordinate map 
slightly so that r is replaced by * =r — 1, 0 by ð = 0 — bo, and ọ by 
ỌÖ =  — po. Then for sufficiently small ¢, the neighborhood V, y with 
coordinate function y: p — (F(p), (p), @(p)) defined for p such that |F | <e, 
|6| <e, and |@| <e defines a coordinate neighborhood of q, with q 
having coordinates (0, 0,0) and with V ^ S? the open subset of $? corre- 
sponding to # = 0. The fact that these neighborhoods are compatible with 
the ones previously defined for S* (Example 1.8) can be proved by writing 
down the standard equations giving rectangular Cartesian coordinates as 
functions of the spherical coordinates. 


(5.4) Remark At this point we have apparently defined three classes of 
submanifolds in a manifold M—immersed, imbedded, and regular. The first of 
these, which we usually call simply a submanifold, includes the last two as 
special cases. It was defined (4.3) as the image N = F(N’) of a C” manifold N’ 
under a univalent immersion F, and was by definition endowed with the C” 
manifold structure of N’. This means simply that we use the one-to-one onto 
correspondence F: N’-+ N to topologize N, and define the C® structure 
on N in terms of coordinate neighborhoods U, », where U = F(U’) and 
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o =(F|N)"}o@’, using all coordinate neighborhoods U’, g’ of the C” 
structure of N’. A priori there is little relation between the topology and 
differentiable structure thus given to the subset N of M and that of M. 
However, the fact that F is continuous implies that the topology of N 
obtained in this way is in general finer than its relative topology as a subspace 
of M, that is, if V is open in M, then V n N is open in N, but there may well be 
open sets of N which are not of this form. 

An imbedding on the other hand is a particular type of univalent 
immersion, one in which U’ is open in N’ if and only if F(U) = VON for 
some open set V of M so that the topology of the submanifold N = F(N’) is 
exactly its relative topology as a subspace of M. An imbedded submanifold is 
thus a special type of (immersed) submanifold. 

Finally, a regular submanifold is an imbedded submanifold with the 
inclusion i as imbedding map, and, conversely, as the next theorem shows, the 
submanifold structure on N = F(N’) given by any imbedding F is that of a 
regular submanifold. Hence we have (i) (immersed) submanifolds and (ii) 
their subclass imbedded or regular submanifolds, the latter being the most 
natural and most important. Perfectly legitimate alternative definitions 
would be as follows: let N be a subset of a C° manifold M which is 
(somehow) independently given the topology and C® structure of a manifold. 
Then N is a submanifold if for this structure the inclusion i: N > M is an 
immersion and a regular submanifold if i is an imbedding, i.e., its given 
topology is the relative topology as a subspace of M. Moreover, 


(5.5) Theorem Let F: N'— M be an imbedding of a C” manifold N of 
dimension n in a C® manifold M of dimension m. Then N = F(N’) has the 
n-submanifold property and thus N is a regular submanifold. As such it is 
diffeomorphic to N’ with respect to the mapping F: N' > N. 


Proof Let q = F(p) be any point of N. As in Theorem 4.12, we choose 
cubical coordinate neighborhoods U, @ of p and V, Y of q such that (i) 
o(p) = (0,...,0)€ C20) = (U), Gi) Yq) = (0,...,0)€ CO) = WV), and 
(iii) the mapping F|U is given in local coordinates by F:(x',..., x") 
(x!,...,x",0,...,0). If F(U) = V ON, then the neighborhood V, y would be 
a preferred coordinate neighborhood relative to N. We may show this to hold 
for restrictions U’ c U and V’ c V. Since F is an imbedding, F(U) is a 
relatively open set of N, that is, F(U) = W ^ N, where W is open in M, and 
since V > F(U), it is no loss of generality to suppose W c V. Thus y(W) is an 
open subset of C™(0) containing the origin and y(W) > W(F(U)), which is a 
slice S of C™(0), S = {xe C%(0)|x"*! =--- = x” = 0}. Therefore we may 
choose a (smaller) open cube C7%(0) < ¥(W) and let V’ = y~ *(CF(0)), Y = 
yI V’. This is a cubical coordinate neighborhood of q for which F(U) a V’ = 
V'aN; moreover, taking U’ =~ '(C(0))=F~*(V’), we see that 
U’, g’, with g’ = o| U’, is a coordinate neighborhood of p and the pair U’, g’, 
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and V’, w’ have exactly the properties needed, namely, (i), (ii), (iii) and 
F(U’) = V’ Aa N. This proves simultaneously that N has the n-submanifold 
property and that F is a diffeomorphism. The latter is true since the inverse of 
F: N' > N is given in the local preferred coordinates V’, zo y’ and U’, g’ by 
F-4001,..., x") = (xt, ..., x"), which is clearly C°”. | 


(5.6) Remark The following addendum to Remark 4.13 and to the preced- 
ing comments is often useful. Suppose that N c M is an (immersed) sub- 
manifold and that q€ N. Then there is a cubical neighborhood V, w of q with 
yla) = (0, ..., 0)€ C7(0) = W(V) such that the slice S’ c V, consisting of all 
points of V whose last m — n coordinates vanish, is an open set and a cubical 
coordinate neighborhood U, of the submanifold structure of N with U = S’ 
and g(r) = n o y(r) = x'(r),..., x"(r)). Thus the coordinates on U are just the 
restrictions to U of the first n coordinates of V, y. The proof is left as an 
exercise. 


We now have a definition of regular submanifold, and we wish to obtain 
examples—which are useful for many reasons, among which is that they give 
further interesting examples of manifolds. Since it is usually easier to deter- 
mine that a map from one C” manifold into another is an immersion than to 
see that it is an imbedding, the following theorem is useful. A generalization 
is given in Exercise 1. 


(5.7) Theorem If F: N — M is a one-to-one immersion and N is compact, 
then F is an imbedding and N = F(N) a regular submanifold. Thus a sub- 
manifold of M, if compact in M, is regular. 


Proof Since F is continuous and both N and N—with the subspace 
topology—are Hausdorff, we have a continuous (one-to-one) mapping from 
a compact space to a Hausdorff space. Since a closed subset K of N is 
compact, F(K) is compact and therefore closed. Thus F takes closed subsets 
of N to closed subsets of N, and being one-to-one onto it takes open subsets 
to open subsets also. It follows that F~' is continuous so F: N > Ñ is a 
homeomorphism and therefore an imbedding. The rest of the statement 
follows from our remarks above. i 


The most useful method of finding examples of submanifolds is given by 
the following theorem and its corollary. Since many examples of manifolds, 
as we have seen, occur as submanifolds of some other manifold, especially 
Euclidean space, the corollary is also very helpful in proving that some of the 
objects we have looked at are indeed C® manifolds. Examples are given 
below. 


(5.8) Theorem Let N be a C® manifold of dimension n, M be a C® manifold 
of dimension m, and F: N —> M be a C® mapping. Suppose that F has constant 


PDF pdfFactory Pro www.fineprint.cn 


80 ill DIFFERENTIABLE MANIFOLDS AND SUBMANIFOLDS 


rank k on N and that q € F(N). Then F~*(q) is a closed, regular submanifold 
of N of dimension n — k. 


Proof Let A denote F~ '(q); A is a closed subset since the inverse image 
of {q}, a closed subset of M, under a continuous map is closed. We shall show 
that A has the submanifold property for the dimension n — k. Let pe A; 
since F has constant rank k on a neighborhood of p, by the theorem on rank 
(Remark 4.2) we may find coordinate neighborhoods U, g and V, of p and 
q, respectively, such that (p) and y/(q) are the origins in R” and R”, ọ(U) = 
C70), w(V) = C70), and in local coordinate (x',..., x") and (y',..., y”), 
F | U is given by the mapping 


po Fop Hx) = F(x',..., x") = (x',...,240,..., O). 


This means that the only points of U mapping onto q are those whose first k 
coordinates are zero, that is, 


ANU =9 (oF 'oy"0)) 


= p-(F-3(0)) = g7 Hx e C20) |x! = ++ = x* = 0}. 
Hence A is a regular manifold of dimension n — k since it has the sub- 
manifold property. i 


(5.9) Corollary If F:N —>M is a C® mapping of manifolds, dim M = 
m < n = dim N, and if the rank of F = m at every point of A = F~! (a), then 
A is a closed, regular submanifold of N. 


The corollary holds because at pe A, F has the maximum rank possible, 
namely m. It follows from Section II.7 and the independence of rank on local 
coordinates that, in some neighborhood of p in N, F has this rank also; thus 
the rank of F is mon an open subset of N containing A. But such a subset is 
itself a manifold of dimension n—an open submanifold-—to which we may 
apply the theorem. 


The first two applications of Corollary 5.9 are just very simple ways of 
demonstrating that S"! and the torus, described, as in Example 1.3.4, by 
rotating a circle around a line in its plane which does not intersect it, are 
both manifolds. Other applications will be made in the next section. 


(5.10) Example The map F:R"—>R defined by F(x’,...,x")= 
yt, (x)? has rank 1 on R" — {0}, which contains F~'(+1) = S"~*. Thus 


i=1 


S"~? is an (n — 1)-dimensional submanifold of R” by Corollary 5.9. 


(5.11) Example The map F: R? = R given by 
F(x?, x?, x3) = (a — (x!)? + PA + (x3)? 
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has rank | at each point of F~'!(b?), a > b > 0. Thus the locus F~ '(b7), the 
torus in R°, Fig. 1.2b, is a submanifold. 


Exercises 


1. Let F:N —> M bea one-to-one immersion which is proper, i.e., the inverse 
image of any compact set is compact. Show that F is an imbedding and 
that its image is a closed regular submanifold of M and conversely. 

2. Show that the mappings i: M > M x N andj: N>M x N defined in 
Exercise 3.5 are imbeddings. 

3. Let Ac Mand B c N be submanifolds of M and N. Show that A x B 
is (by inclusion) a submanifold of M x N. Show that if A and B are 
regular, so is A x B. 

4. Prove the statement of Remark 5.6. 

5. If N is a submanifold of M and V is an open subset of M, possibly 
connected, then show that N a V is the union of a countable collection of 
connected open subsets of N (with its submanifold topology). 

6. Show by example that there may be functions that are C® on a 
submanifold N of M that cannot be obtained by restriction of a C” 
function on M. 

7. Let S? = {xe R°|||x|| = 1} and obtain P?(R) by indentification of antipo- 
dal points of S?. Define F: R? > R* by F(x, x?, x3) = ((x')? — (x?), 
x!x?, x1x3, x?x3). Since F(—x) = F(x), F|S? defines a mapping 
F: P?(R) > R*. Show that F is a C® imbedding (cf. Hilbert and Cohn- 
Vossen [1]). 


6 Lie Groups 


The space R” is a C” manifold and at the same time an Abelian group 
with group operation given by componentwise addition. Moreover the alge- 
braic and differentiable structures are related: (x, y) > x + y is a C” map- 
ping of the product manifold R” x R” onto R", that is, the group operation is 
differentiable. We also see that the mapping of R” onto R” given by taking 
each element x to its inverse — x is differentiable. 

Now let G be a group which is at the same time a differentiable manifold. 
For x, ye G let xy denote their product and x‘ the inverse of x. 


(6.1) Definition G is a Lie group provided that the mapping of 
G x G >G defined by (x, y) > xy and the mapping of G —> G defined by 
x > x7! are both C® mappings. 


(6.2) Example Gi(n, R), the set of nonsingular n x n matrices, is as we 
have seen, an open submanifold of .4,(R), the set of n x n real matrices 
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identified with R"’. Moreover Gi(n, R) is a group with respect to matrix 
multiplication. In fact, an n x n matrix A is nonsingular if and only if 
det A # 0; but det(AB) = (det A)(det B) so if A and B are nonsingular, AB 
is also. Ann x n matrix A is nonsingular, that is, det A # 0, if and only if A 
has a multiplicative inverse; thus Gi(n, R) is a group. Both the maps 
(A, B) > AB and A > A“! are C®. The product has entries which are poly- 
nomials in the entries of A and B, and these entries are exactly the expres- 
sions in local coordinates of the product map, which is thus C®, hence C®. 
The inverse of A = (a;;) may be written as A~* = (1/det A)(4,;), where the 
a;, are the cofactors of A (thus polynomials in the entries of A) and where 
det A is a polynomial in these entries which does not vanish on Gi(n, R). It 
follows that the entries of A~' are rational functions on Gi(n, R) with non- 
vanishing denominators, hence C® (and C%). Therefore Gl(n, R) is a Lie 
group. A special case is GI(1, R) = R*, the multiplicative group of nonzero 
real numbers. 


(6.3) Example Let C* be the nonzero complex numbers. Then C* is a 
group with respect to multiplication of complex numbers, the inverse being 
z7! = 1/z. Moreover C* is a one-dimensional C” manifold covered by a 
single coordinate neighborhood U = C* with coordinate map z > ¢(z) 
given by v(x + iy) = (x, y) for z = x + iy. Using these coordinates, the 
product w = zz’, z = x + iy, and z’ = x’ + iy’ is given by 


((x, yx, y)) > (xx — yy’, xy’ + yx’) 
and the mapping z > z~! by 


(x, y) -( z => ) 


xy Hy? 


This means that the two mappings of Definition 6.1 are C” ; therefore C* is a 
Lie group. 


(6.4) Theorem If G, and G, are Lie groups, then the direct product 
G, x G, of these groups with the C” structure of the Cartesian product of 
manifolds is a Lie group. 


The proof is left as an exercise. 


(6.5) Example (The toral groups) The circle S' may be identified with 
the complex numbers of absolute value + 1. Since |z; z3| = |z; | |z2|, it is 
a group with respect to multiplication of complex numbers—a subgroup of 
C*. It is a Lie group as can be checked directly or proved as a consequence 
of Example 6.3 and the next theorem. Combining this with Theorem 6.4, we 
see that T" = S! x --- x St, the n-fold Cartesian product, is a Lie group. It 
is called the toral group. Since S* is abelian, T” is Abelian also. 
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As might be expected, the subgroups—in the sense of algebra—of a Lie 
group which are also submanifolds play a special role. 


(6.6) Theorem Let G be a Lie group and H a subgroup which is also a 
regular submanifold. Then with its differentiable structure as a submanifold H 
is a Lie group. 


The proof requires the following lemma, which is important in its own 
right. 


(6.7) Lemma Let F: A >M be a C” mapping of C” manifolds and sup- 
pose F(A) < N, N being a regular submanifold of M. Then F is C” as a 
mapping into N. 


Proof Since N is a regular submanifold of M, each point is contained in 
a preferred coordinate neighborhood. Let pe A, let q = F(p) be its image, 
and let U, ọ be a neighborhood of p which maps into a preferred coordinate 
neighborhood V, y of q. We have y(V) = C7(0) with (q) = (0,..., 0), the 
origin of R”, m = dim M; and V ^ N consists of those points of V whose 
last m — n coordinates are zero, n = dim N. Let (x',..., x?) be the local 
coordinates in U, œ on A. Then the expression in local coordinates for F is 


Peg ET (ictal Oi 0, -.-, 0), 


that is, f"**(x) = + = f(x) = 0 since F(A) = N. 
However, V ^A N, n o W |y an is a coordinate neighborhood of q on N, so 
F, considered as a mapping into N, is given in local coordinates by 


(xt, x") > (SE), a E) 


This is Ê followed by the projection z to the first n coordinates (projection of 
R” to R"), which is a composition of C” maps and is therefore C®. 


(6.8) Remark Lemma 6.7 does not hold for general submanifolds! In 
Example 4.9 if we map the open interval (—1,1) by a mapping G into 
N = F(R), the figure eight, so that it crosses the origin as shown in 
Fig.III.10, then G is C” as a mapping into R?, but not even continuous as a 
mapping to N. Thus N is diffeomorphic to the real line by F: R > N, and 
identifying N and R, we may think of G as taking part of the open interval 
(—1, 1), say (0, 1), onto the real numbers t > 1, 0 onto 0, and (— 1, 0), the 
remaining part, onto the real numbers t < — 1. The image is not even con- 
nected, so G is not continuous. But this lemma can be generalized to all 
submanifolds by adding a continuity condition: see Exercise 5 (also Exercise 
7 for an important consequence). 
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-1 M = R? 
Figure 111.10 


Proof of the Theorem It follows without difficulty that H x H is a 
regular submanifold of G x G. Thus the inclusion map F,: H x H>GxG 
is a C? imbedding. If F,:G x G—G is the C” mapping (g, g') > gg’ and 
F = F, » F, the composition, then F is a C” mapping from H x H > G with 
image in H. Let F denote this map considered as a map into H; it is not the 
same mapping as F since we have changed the range. We must show that Fis 
C® and similarly that the map H — G given by taking h > h~ ' is C” as a map 
onto H. These facts both follow from the lemma, which completes the proof. 


We may use Theorem 6.6 and Theorem 5.8 to give many further 
examples of Lie groups. To do so we make use of the following naturally 
defined maps of a Lie group G onto itself: (i) x > x~ +, (ii) left and right 
translations by a fixed element a of G, that is, La, R,: G > G defined by 
L,(x) = ax and R,(x) = xa. These maps are C® by definition of Lie group 
and have inverses which are C”, so they are, in fact, diffeomorphisms. The 
mapping of (i) is its own inverse and we have (L,)>* = L,-ı and 
(Ra)? = R,-1- 


(6.9) Example Si(n, R) = {X e Gl(n, R)| det X = +1} is a subgroup and 
regular submanifold of Gi(n, R), hence a Lie group. To prove this, we con- 
sider the mapping F: Gi(n, R) > R*, F(X) = det X. According to the pro- 
duct rule, det(X Y) = (det X)(det Y). Thus F is a homomorphism onto 
R* = GI(1, R); it is also C® since it is given by polynomials in the entries. 
Finally, its rank is constant: Let Ae Gl(n, R); let a = det A; and let Ly, Ly 
denote left translations in Gi(n, R) and Gi(1, R) = R*. Then 


F(X) = L, ° Fe Ly-:(X) 


since we have a-det(A~'X)=det X. Applying the chain rule, 
Section II.2.3, using DL, = a + 0, and using the fact that L,-: is a diffeo- 
morphism (so that DL,-; is nonsingular), we have 


rank DF(X) = rank[aDF(A~'X)DL,-.(X)] = rank DF(A~*X) 
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for all AeGi(n,R). In particular, rank DF(X) = rank DF(X~'X) = 
rank DF(I), and thus we see that the rank is constant as claimed. It follows 
that Si(n, R) = F~*'(+1) is a closed, regular submanifold by Theorem 5.8. It 
is also a subgroup—in fact the kernel of a homomorphism—by virtue of the 
product rule for determinants; therefore it is a Lie group. 


(6.10) Example O(n) = {X € Gi(n, R)|'X X = I}, the subgroup of ortho- 
gonal n x n matrices is a regular submanifold and thus a Lie group. Let 
F(X) = 'XX, 'X = transpose of X, define a mapping from Gl(n, R) to 
Gl(n, R). If Ae Gl(n, R), we will show that rank DF(X) = rank DF(XA~'); 
and since any Y e Gl{n, R) can be written in the form Y = XA’, it follows 
that rank DF is constant on Gi(n, R). To obtain this equality we note that 
F(XA~') = Lig-1° Ry-1° F(X). Therefore 
DF(XA~') = DLi4-1 °DR,-1 °DF(X), 

where DR,-, and DL,,-1 are evaluated at F(X) and R,-:(F(X)), respec- 
tively. Then the equality of rank DF(XA~') and rank DF(X) follows as 
above from the fact that DL,,-: and DR,-; are everywhere nonsingular. 
Since O(n) = F~'(I), where I is the identity matrix, the statement follows 
from Theorem 5.8. 


(6.11) Definition Let F: G, > G, be an algebraic homomorphism of Lie 
groups G, and G,. We shall call F a homomorphism (of Lie groups) if F is 
also a C” mapping. 


(6.12) Example Let G, = Gi(n, R) and G, = R* [= GI(1, R)]. Then the 
map F given by F(X) = det X is a homomorphism. 


(6.13) Example Let G, = R, the additive group of real numbers, and 
G, =S', identified with the multiplicative group of complex numbers of 
absolute value 1. Then the mapping F(t) = e?** is a homomorphism. 
Similarly, letting G, = R", a Lie group with componentwise addition, and 
G, = T” =S! x... x S', the mapping F: R” > T" given by F(t,,...,¢,) = 
(exp 2zit,,...,exp 2zit,) is a homomorphism. Its kernel is the discrete 
additive group Z” consisting of all n-tuples of integers; it is called the integral 


lattice of R". 


(6.14) Theorem If F: G, > G, is a homomorphism of Lie groups, then the 
rank of F is constant ; the kernel is a closed regular submanifold and thus a Lie 
group; and dim ker F = dim G, — rank F. 


Proof Let ae G, be arbitrarily chosen and let b = F(a) be its image in 
G,. Denote by e, e, the unit elements of G,, G3 , respectively. Then we may 


write 
F(x) = F(aa~ x) = F(a)F(a~*x) = Lye F(a” x), 
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so that for all xe G; 
DF(x) = DL,- DF(a7 +x). 


Then, since L, is a diffeomorphism, and thus has nonsingular Jacobian 
matrix at each point, the rank of F is the same at x and a~ 1x, hence constant. 
By Theorem 5.8, ker F = F~‘(e,) is a closed regular submanifold whose 
dimension is dim G, — rank F. From Theorem 6.6, ker F is a Lie group since 
it is a regular submanifold (and a group). | 


(6.15) Example A very useful example of a submanifold which is not 
regular but is a subgroup of a Lie group is obtained as follows: Let 
T? =S! x S! and let F:R?>T? be given by F(x',x?)= 
(exp 2nix!, exp 2nix?) as in Example 6.13. Then F is a C” map of rank 2 
everywhere and is a homomorphism of Lie groups; the rank may be easily 
computed at (0, 0) and it is constant by Theorem 6.14. 


Now let « be an irrational number and define G: R > R? by G(t) = 
(t, at). Thus G is obviously an imbedding; its image is the line through the 
origin of slope a. It follows that H = F » G is an immersion of R into T? 
since DH = DF - DG has rank 1 for all te R. Moreover H is one-to-one 
since H(t,) = H(t,) is equivalent to exp 2zit,; = exp 2nit, and exp 2niat, = 
exp 2niat,. However, exp 2riu = exp 2ziv if and only if u — v is an integer. 
Clearly t, —t, and a(t, — t2) are both integers only if t, = t}. Thus 
H: R > T? is a one-to-one immersion and H(R) is an immersed submani- 
fold. However, the interesting fact is that H(R) is a dense subset of T?, so it is 
about as far from being a regular submanifold as one can imagine: for 
example, as a subspace it is not locally connected at any point. 

We shall prove that H(R) is dense in T?. Since F is continuous and onto, 
a dense subset D of R? is mapped to a dense subset of T?. We will show that 
D = F~'(H(R)) is dense. D consists not only of the line of slope « through 
the origin but of all lines which can be obtained from it by translation by an 
integral vector in either direction, that is, any points (x! + m, x? + n), with 
m, n integers and with x! = t, x? = at, must also be in D since F(x’, x”) = 
F(x! + m, x? + n). These lines are all parallel to the given one H(R). In fact 
D consists of the union of all lines t > (t + m, at + n), that is, all lines with 
equation 


x? = ax! + (n — am) 


for arbitrary integers n, m (Fig. III.11). 


Obviously, D is dense on the plane if the y-intercepts (n — am) form a 
dense subset of the y-axis. Thus we must show that given a, any real number 
b, and any £ > 0, there is a pair of integers n, m with |b — (n — am) | < e. 
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x? x?=ax'+(4-3a) 


Figure III.11 


Assume that there exist integers n’, m such that O < n — am’ < £; since 
n — um is irrational, it must then in fact be positive. It follows that for some 
integer k, k(n’ — am’) < b < (k + 1)(n' — am’), which implies 0 < b — 
k(n — am’) < n — am < e. Since n — am = kn’ — akm is a y-intercept ofa 
line of D, such that |b — (n — am)| < e, the following fact from number theory 
suffices to complete the proof: Choose n’, m so that |m’| > 1/e, hence so that 
0<n—am <séor0 <(-n) — alm )< e. 


(6.16) If «> 0 is any irrational number, then there exist arbitrarily large 
integers n', m' such that 


A proof is given by Auslander and MacKenzie [1], or Hilbert and Cohn- 
Vossen [1]. The preceding facts (Example 6.15) are essentially the Kronecker 
approximation theorem; several beautiful proofs are given by Bohr [1]. 

We remark that H: R > T? in addition to being a one-to-one immersion 
is a homomorphism of Lie groups so that R = H(R) is a subgroup alge- 
braically and an immersed submanifold. It is clearly a Lie group with the 
manifold structure of R. However, it is not a regular submanifold nor is ita 
closed subset, although it is the image of a closed regular submanifold under a 
local diffeomorphism F. 
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(6.17) Definition A (Lie) subgroup H of a Lie group G will mean any 
algebraic subgroup which is a submanifold and is a Lie group with its C* 
structure as an (immersed) submanifold. 


We have already discussed subgroups that are regular submanifolds. We 
shall prove the following theorem about such subgroups. 


(6.18) Theorem [fH is a regular submanifold and subgroup of a Lie group 
G, then H is closed as a subset of G. 


Proof Let V, y bea preferred coordinate neighborhood of e, the identity, 
i.e., Y(e) = (0, ..., 0) € C”(0) and g€ H if and only if V ^ H/x"*'(g) = --- = 
x™(g) = 0, n = dim H and m = dim G. If 0 < ô < e, then V’ = y~ +(C3(0), 
V' and V’ o Hare in V. Let {h,} c H be a sequence such that lim,- œ h, = g. 
We must show that ge H, hence H = H. For 0<6<e choose W a 
neighborhood of e sufficiently small so that W7 ŻW = {g7 9219192 €G} € 
V' (cf. Exercise 1). Then there exists N > 0 such that L,- ,(h,) = g7 th, € W 
for n > N, and thus (g~'h,,)~1g~ th, = ha h e W~'W c V’. By continuity 
of the group product, lim hp h, = hm g, but since hp'h eV’ for n, 
m >N, we see that h, 'h„g e V' c V. On the other hand x/(h,,"h,) = 0 for 
j= n+ 1,...,m, and hence x/(h,,'g) = 0 for the same j. Since this point is in 
V, it follows that ha *ge H, which implies ge H. This completes the proof. 
[Example 4.7 shows that this theorem does not hold for submanifolds in 
general. | I 


(6.19) Remark A converse statement is also true: A Lie subgroup H of G 
that is closed as a subset, i.e., H = H, is necessarily a regular submanifold (see 
Lemma IV.9.7). Thus closed Lie subgroups play a special role in Lie group 
theory (cf. also Theorem 6.14)—although Example (6.15) shows that this 
property is not preserved by homomorphisms. It is possible, in fact, to prove 
a much stronger statement: If H is an algebraic subgroup of G and a closed 
subset, then G is a Lie subgroup—and being closed, a regular submanifold. 
For a proof of this important theorem (somewhat beyond the scope of this 
text), see Helgason [1, Chapter II, Section 2], Chevalley [1], or Adams 
[1, 2.27] (after studying Chapter IV!). Nevertheless, it motivates and vali- 
dates terminology which we use hereafter: A subgroup H of a Lie group G, 
which is a regular submanifold, will be called a closed subgroup of G. 


Exercises 


1. Show that given any neighborhood U of e, the identity ofa Lie group G, 
there exists a neighborhood V of e such that VV~' c U, and a neigh- 
borhood W of e such that W? = WW = {g,92|91,92€ W} © U. 

2. Show that the collection {xU}, over all neighborhoods U of e, is a 
base of neighborhoods for x (similarly for {Ux}). 
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3. Let A be an arbitrary subset and U an open subset of a Lie group. Show 
that AU = {au |ae A, ue U} is open. 

4. Prove Theorem 6.4 and also that the projections p,, pz and injections i, j 
of Exercise 3.3, with (a, b) = (e4, e2) the identity of G} x G,, are homo- 
morphisms of Lie groups. 

5. In Lemma 6.7 show that if N is an immersed submanifold and F is 
assumed to be continuous as a mapping into N, then F is C” as a 
mapping into N. 

6. Show that if G is a Lie group, a € G, then the map I,: G > G, defined by 
I,(x) = axa~', is an automorphism of G. 

7. Show that two one-to-one C® immersions F,, F}: N’ > M with the same 
image N are equivalent in the sense of Exercise 4.4 if they induce the same 
topology on N. [Hint: Use Exercise 5.] 

8. Show that if H is an algebraic subgroup ofa Lie group G, then its closure 
A is also an algebraic subgroup. What does Remark 6.19 then imply? 

9, In Example 6.15, assume that « (used in the definition of G) is a rational 
number. Show that H(R) is then a regular submanifold of T? diffeo- 
morphic to S*. (In fact, it will be a knotted S’; see Seifert and Threifall [1, 
Section 52]. 


7 The Action of a Lie Group on a Manifold. 
Transformation Groups 


The definition of a group as a set of objects with a law of composition 
satisfying certain axioms is a relatively recent development. Historically, 
groups arose as collections of permutations or one-to-one transformations 
ofa set X onto itself with composition of mappings as the group product; for 
if X is any set whatsoever, then the collection S(X) of all of its 
“permutations” —in this broad sense—is easily seen to be a group with 
respect to composition of permutations as product. The same is true for any 
subcollection G which contains, together with each transformation 
a: X > X, its inverse o~', and which contains the composition ø ° t of any 
two of its elements ø and z. In particular, if X contains just n elements, then 
§(X) is the symmetric group on n letters and has n! elements, the one-to-one 
transformations of X onto itself. This was, for example, the point of view of 
Galois [1], who considered groups of permutations of the roots of a polyno- 
mial. Much later Klein [1] discovered the central role of groups in all of the 
classical geometries: Euclidean, projective, and hyperbolic (non-Euclidean). 
In this approach, to each geometry is associated a group of transformations 
or permutations of the underlying space of the geometry, and in each case 
the geometry with its theorems may be derived from a knowledge of the 
underlying group. For example, the group G of Euclidean plane geometry is 
the subgroup of S(E?) which leaves distances invariant: If x, ye E? and 
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d(x, y) is their distance, then a transformation T: E? > E? is in the group if 
and only if d(Tx, Ty) = d(x, y) for all x, y. This is called the group of rigid 
motions and it is generated by translations, rotations, and reflections. 

Even though the concept of group does not appear in Euclid’s axioms, 
that of congruence does. These ideas are intimately related: Two figures are 
congruent if and only if there is a rigid motion ø in G which carries one 
figure onto the other. It is a consequence of the properties mentioned above 
for ø, te G that congruence is an equivalence relation. 

Although the interpretation of groups as transformation groups of a 
space has been superseded in algebra, it is still very important in geometry 
and we shall need to discuss it in various aspects. We first define a very slight 
generalization of the group S(X). 


(7.1) Definition Let G be a group and X a set. Then G is said to act on X 
(on the left) if there is a mapping 6: G x X > X satisfying two conditions: 


(i) Ife is the identity element of G, then 
O(e, x) = x for all xe X. 
(ii) Ifg,,g.¢G, then 
(91, (92, x)) = O(gig2,x) forall xe X. 


When Gis a topological group, X is a topological space, and 0 is continuous, 
then the action is called continuous. When G is a Lie group, X is a C” 
manifold, and 0 is a C”, we speak of a C% action. C® action is a fortiori 
continuous. 


As a matter of notation we shall often write gx for O(g, x) so that (ii) 
reads (g1 g2)x = gı(g2 x). We also let 6,(x) denote the mapping 0,: X > X 
defined by 6,(x) = 0(g,x),g fixed, so that (ii) may also be written 


94192 = 99, ° 94, When we define right action, (i) and (ii) become: 


(i) A(x, e) = x and (ii) o(0(x, gı) g2) = O(x, 9192). 


Usually we are concerned with left action, but in both cases we usually say G 
acts on X, and leave the rest to be determined by the context. 

Note that 0,-ı = (0,)"* since 9,-1° 0, = 9,-1, = Oe = ix, the identity 
map on X by (i). This means that each mapping 0, is one-to-one onto, since 
it has an inverse. This and (ii) show that the following statement holds: 


9192 


(7.2) If G acts ona set X, then the map g — 0, is a homomorphism of G into 
S(X). Conversely, any such homomorphism determines an action with 


O(g, x) = (x). 
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We note that the homomorphism is injective if and only if 0, being the 
identity implies that g = e. If this is so, we shall call the action effective. 
When the action is effective, G may be identified with a subgroup of S(X) by 
this map g > 0, so that we have precisely the situation discussed in the 
beginning of the paragraph. Needless to say, these considerations all refer 
only to the set-theoretic aspects, since S(X) has not been topologized. 

We also note that if X is a topological space (C* manifold), G a topologi- 
cal group (Lie group), and the action is continuous (C®), then each 0, is a 
homeomorphism (diffeomorphism). 


(7.3) Example Let H, G be groups, and y: H > G, a homomorphism. It 
is easy to check that 0:H x G —>G defined by 0(h, x) = W(h)x is a left 
action: 

(i) O(e, x) = W(e)x = x, since y takes the identity of H to the identity 
of G, and 

(ii) (hy, O(h, x)) = O(h, Wha)x) = (Its )(W(hg)x) and 


O(h, hy , x) = W(hy h2)x = (w(h,)b(hz))x. 


These agree by the associative law in G. If H and G are Lie groups and y is a 
homomorphism of Lie groups, then the action is C*. This may be applied to 
the case where H is a Lie subgroup of G (or even if H = G); in this case is 
the identity (inclusion) mapping of H into G and we say that H acts on G by 
left translations. 


(7.4) Example A rather simple but important example is known as the 
natural action of Gl(n, R) on R": We let G = Gi(n, R) and X = R” and we 
define 0: G x R" > R” by O(A, x) = Ax, this being multiplication of the 
n x n matrix A by the n x 1 column vector obtained by writing xe R” 
vertically. This satisfies (i) and (ii) rather trivially, (ii) being again associati- 
vity (of matrix products): 

(AB)x = A(Bx). 


Since 0: G x R” — R" is given by polynomials in the entries of 
AeGl(n, R) and xe R”, it is a C®-map: 


x! 


o ((ap{ : |) = (Sau) 


n 


x 


Now suppose that H c Gin, R) is a subgroup in the sense of Lie groups, 
that is, H has its own Lie group structure such that the inclusion map 
i: H > Gl(n, R) is an immersion, or—if H is a closed subgroup—an imbed- 
ding. Then 0 restricted to H defines a C” action 0y: H x R” > R”. This is 
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because 0,, = 0 0 i, i: H > G is the inclusion map, and both 0 and i are C”. 
Using this idea we may give further examples. 


(7.5) Example Let H c GI(2, R) be the subgroup of all matrices of the 
form (4 £) witha > 0. Then H is seen to be a two-dimensional submanifold 
of Gi(2, R) and therefore is a closed subgroup. The restriction to H of the 
natural action of Gi(2, R) on R? is just 


a b\ {x ax! + bx? 
elo op =P ae”) 
which is obviously C”, as expected. 


(7.6) Example Identify E" with R” and let d(x, y) = (7, (x! — yyy”? 
be the usual metric. The group G of all rigid motions, that is, diffeomor- 
phisms T: R” > R” such that d(Tx, Ty) = d(x, y) is given by transforma- 
tions T of the form 

(*) T(x) = Ax + b, 


where A e O(n) and b e R"—a rotation’ A of R” about the origin followed by 
a translation taking the origin to b. The group operation is composition of 
rigid motions. 


The group of rigid motions is a Lie group. It is in one-to-one correspon- 
dence with O(n) x R” and takes its manifold structure from this correspon- 
dence, which is given by assigning to each rigid motion (*) the pair 
(A, b)e O(n) x R". [However, G is not a direct product in the group theor- 
etic sense (Exercise 6).] Since 0: G x R” —> R” defined by 6((A, b), x) = 
Ax + b, 0 is a C* mapping. Verification that 0 defines an action is left to the 
exercises. 


(7.7) Definition Let a group G act ona set M and suppose that A = M is 
a subset. Then GA denotes the set {ga | g € G and a e A}. The orbit of x e M is 
the set Gx. If Gx = x, then x is a fixed point of G; and if Gx = M for some x, 
then G said to be transitive on M. In this case Gx = M for all x. 


(7.8) Example Consider the natural action of Gi(n, R) on M = R". The 
origin 0 is a fixed point of Gl(n, R) and Gi(n, R) is transitive on R” — {0}. For 
if x = (x',..., x") #0, then there is a basis f,,...,f, with x = f,. If we 
express these basis elements in terms of the canonical basis f; = )"-, ajej, 
i= 1,...,n, then we see that x = A-e,, A = (a;;)€ Gl(n, R). From this it 
follows that every x Æ 0 is in the orbit of e,. This action is not very inter- 
esting from the point of view of its orbits. However, if we consider this action 


t Including reflections: A is a proper rotation if det A = +1. 
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restricted to various subgroups G c Gi(n, R), then the orbits can be quite 
complicated. A relatively simple case of this type is obtained by letting 
G = O(n), the subgroup of n x n orthogonal matrices in Gl(n, R). This is a 
closed subgroup as we have seen, and the natural action of Gi(n, R) res- 
tricted to O(n) is a C” action by Example 7.4. The orbits are the concentric 
spheres with the origin being a fixed point (sphere of radius zero). 


(7.9) Remark The same facts from linear algebra that we used above also 
show that Gi(n, R) is transitive on the collection B of all bases of R”. Given 
any basis {f,,..., f,}, then there exists A e Gl(n, R) such that A: e; = f,, in 
fact there is exactly one such A. Thus, letting f= {f,,...,f,} and 
e = {e,,..., e,} be elements of B, we may define a left action of Gi(n, R) on B, 
that is, a mapping 6: Gi(n, R) x B > B by 


O(A, e) = A -e = f = {Ae,..., ACn}. 


This action is transitive as mentioned, moreover the uniqueness of A (such 
that A -e = f) implies that it is simply transitive, that is, given bases f, Ť there 
is exactly one A e Gi(n, R) such that A: f = È This means that Gl(n, R) is in 
one-to-one correspondence with B: A e Gi(n, R) corresponds to A - e, where 
e is the canonical basis. We may use this correspondence to give B the 
topology and C® structure which makes it diffeomorphic to Gi(n, R). As a 
C®* manifold it is called the space of frames of R". 


We have already mentioned quotient spaces of an equivalence relation as 
a possible source of manifolds and, in fact, we have produced two examples 
of such: projective spaces and Grassman manifolds. The most useful and 
important source of such spaces is furnished by the action of groups on 
manifolds; at the moment we can only consider the topological aspects, and 
then only in part, as a preview of things to come. 

As a matter of notation we let G denote a Lie group, M a C” manifold, 
and we assume a C® action 0: G x M > M. We define a relation ~ on M 
by p ~ q if for some g € G we have q = 0,(p) = gp. It is easily seen that ~ is 
an equivalence relation and that the equivalence classes coincide with the 
orbits of G. In fact, p ~ p since p = ep and p ~ q means q = gp, which 
implies p = g ‘gq or q ~ p, so that the relation is reflexive and symmetric. 
Finally, given that p ~ qand q ~ p, we must have q = gp andr = hq so that 
r = (hg)p and then p ~ r. Obviously, p ~ q implies that p and q are on the 
same orbit, so the equivalence class [p] = Gp. Conversely, if q€ Gp, then 
p ~ q so Gp € [p]. 

We denote by M/G the set of equivalence classes; it will always be taken 
with the quotient topology (Definition 2.1) and will often be called the orbit 
space of the action. With this topology the projection n: M > M/G (taking 
each xe M to its orbit) is continuous, and since the action 0 is continuous, z 
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is also open: If U = M is an open set, then so is 0,(U) for every ge G and 
hence GU = [U] = (),<¢ 9,(U), being a union of open sets, is open. The 
orbit space need not be Hausdorff—but if it is, then the orbits must be closed 
subsets of M (each is the inverse image by z of a point of M/G and points are 
closed in a Hausdorff space). We shall be particularly interested in discover- 
ing examples in which M/G is a C” manifold and x a C” mapping. 


(7.10) Example When M = R" and G = O(n) acting naturally as a sub- 
group of Gl(n, R), then the orbits correspond to concentric spheres and thus 
are in one-to-one correspondence with the real numbers r > 0 by the map- 
ping which assigns to each sphere its radius. This is a homeomorphism of 
R"/O(n) and the ray 0 < r < œ; this is not a manifold, but it is almost one. 


(7.11) Example Let G be a Lie group and H a subgroup (in the algebraic 
sense). Then H acts on G on the right by right translations. If H is a Lie 
subgroup, then according to Example 7.3 this is a C* action; the set G/H of 
left cosets coincides with the orbits of this action and is thus a space with the 
quotient topology. We have the following facts concerning G/H (with this 


topology). 


(7.12) Theorem The natural map x: G > G/H, taking each element of G to 
its orbit, that is, to its left coset, is not only continuous but open; G/H is 
Hausdorff if and only if H is closed. 


Proof Since this space—usually called the (left) coset space—coincides 
with the orbit space of H acting on G, z is continuous and open. To prove 
the last statement we use the C“ mapping F:G x G—G defined by 
F(x, y) = y ‘x. Since F is continuous and F~ '(H) is the subset R = {(x, y) | 
x ~ y} of G x G, we see by Lemma 2.4 that R is closed and G/H is Haus- 
dorff if and only if H is a closed subset of G. I 


We conclude this section with two definitions, using terminology which 
we justify in the exercises. 


(7.13) Definition Let G be a group acting on a set X and let xe X. The 
stability or isotropy group of x, denoted by G,, is the subgroup of all ele- 
ments of G leaving x fixed, G, = {ge G | gx = x}. 


(7.14) Definition Let G, X be as in the previous definition. Then G is said 


to act freely on X if gx = x implies g = e, the identity, that is, the identity is 
the only element of G having a fixed point. 
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Exercises 


1. Show that if G acts on X as in Definition 7.13, then for each x, G, is a 
subgroup of G, which in the case of continuous action is a closed subset 
of G. 

2. Suppose that G acts transitively on X. Then given x, ye X, prove that 
G, and G, are conjugate subgroups of G. 

3. Let G act transitively on X and let xp be a point of X. Define F: G > X 
by F(g) = gxo. Prove: (i) that there is a unique one-to-one mapping 
F: G/G,,, > X such that F = F o n, n: G > G/G,,, the natural projec- 
tion to cosets, (ii) that F and F are continuous if the action is contin- 
uous and in this case F is a homeomorphism if and only if F is open. 

4. Show that O(n) acts transitively on S”~ 1, the unit sphere of R”, in a 
natural way and determine the isotropy subgroup. 

5. Show that Gi(n, R) acts transitively on P"~'(R) and determine the 
isotropy subgroup of [(1, 0, ..., 0)]. 

6. Let G=O(n)x V” and define a product in G by (A, v)(B, w) 
= (AB, Bu + w). Prove that G is a Lie group and acts on R” by 
(A, v): x = Ax + v. Show that {I} x V", I being the identity matrix, is 
a closed submanifold and normal subgroup and that O(n) x {0} is a 
closed submanifold and subgroup of G. 

7. Let G be the set of 2 x 2 matrices of the form (4 ?), wherea > Oand b 
are real numbers. Show that G is a Lie group and acts on R by 


ollo m =ax +b. 


8. Let H,a subgroup of G, act on G by left translations. Prove that this is a 
free action. Show that if G acts freely and transitively on the left on X, 
then G and X are in one-to-one correspondence and if they are 
identified, the action is equivalent to left translations. 

9. Let the multiplicative group of nonzero real numbers R* act on 
R5*! = R"*! — {0} by 0: R* x RG*' > R*+, defined by A(t, x) = tx. 
Show that R%,*!/R* is homeomorphic to P"(R). 

10. Let G be a Lie group and H be a closed subgroup and define a left 
action 0: G x G/H > G/H by 0(g, xH) = (gx)H. Show that this action 
of G on the coset space G/H is continuous and that the isotropy group 
of [e] = H is exactly H itself. 

11. As in Example 2.6 identify the k-frames in R", F(k,n), with k x n 
matrices of rank k. Show that Gi(k, R) acts on F(k, n) by left multiplica- 
tion with G(k, n) as orbit space and that the right action of Gi(n, R) on 
F(k,n) induces a transitive action on G(k, n). 
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8 The Action of a Discrete Group on a Manifold 


We will consider in some detail what might seem to be the simplest case 
in which we could hope to use group action to obtain new examples of 
manifolds via the quotient or orbit space concept discussed in the previous 
section. By a discrete group I we shall mean a group with a countable 
number of elements and the discrete topology (every point is an open set). 
The countability means that T falls within our definition of a manifold: it 
has a countable basis of open sets each homeomorphic to a zero- 
dimensional Euclidean space, that is, a point. Thus T is a zero-dimensional 
Lie group. In this case to verify that an action 0: T x M > M is C*, we need 
only show that for each he T the mapping 0,: M > M isa diffeomorphism. 
For convenience of notation, we will let h denote 0, , writing hx for 0,(x), and 
so on. We suppose then that a C® action is given and consider the set of 
orbits M = M/T with the quotient topology discussed before: U = M is 
open if and only if x~1(U) is open in M, where n: M > M denotes the 
natural map taking each x to its orbit x; we have seen that z is then 
continuous and open. 

If M is Hausdorff in this topology, then points are closed sets and the 
inverse image of any pe M, that is, the orbit x” '(p), must be closed. Thus an 
obvious necessary condition for M to possess some kind of reasonable topo- 
logy and manifold structure is that for each xe M the orbit Fx is closed. 
However, this condition is not sufficient. A stronger requirement is the 
following: Given any point x € M and any sequence {h,} of distinct elements of 
T, then {h,,x} does not converge to any point of M. A group action with this 
ipioperty is called discontinuous; it is equivalent to the requirement that each 
orbit be a closed, discrete subset of M. In the presence of other conditions 
this is sometimes enough to ensure that M/T is Hausdorff (see Exercise 2), 
but in general we need the following condition, which is even stronger: 


(8.1) Definition A discrete group I is said to act properly discontinuously 
on a manifold M if the action is C” and satisfies the following two 
conditions: 

(i) Each xe M has a neighborhood U such that the set {her | 
hU ^n U # Ø} is finite; 

(ii) If x, ye M are not in the same orbit, then there are neighborhoods 
U, V of x, y such that Un TV = Ø. 


Observe that (ii) implies at once that M = M/T is Hausdorff: In fact it is 
equivalent to the statement that the subset R = {(x, y)|x ~ y} = M x Mis 
closed (compare Lemma 2.4). 

A consequence of proper discontinuity is the following statement whose 
proof is left as an exercise (Exercise 3)—it could be used to replace (i) in the 
definition, so we denote it by (i). 
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(i) The isotropy group I, of each xeM is finite, and each x has a 
neighborhood U such that hU ^n U = Ø if hg T, and hU = U ifheT,. 


(8.2) Example Let M = S"~', the set {xe R" | ||x|| = 1} and F = Z3, the 
cyclic group of order 2 with generator h, that is, I consists of h and h? =e, 
the identity. Then h(x) = — x and e(x) = x defines an action of T on S""'. It 
is left as an exercise to see that the action 0: Z, x S"-' + S"! is free and 
properly discontinuous and that the quotient space S"~1/Z, is none other 
than real projective n — 1 space P"~*(R). 


This and other examples, such as S! identified with R/Z (see Example 8.7 
below), lead to the following theorem: 


(8.3) Theorem Let T be a discrete group which acts freely and properly 
discontinuously on a manifold M. Then there is a unique C® structure of 
differentiable manifold on M = MJT (with the quotient topology) such that 
each peM has a connected neighborhood U with the property: 
n-'(U) =|] Ū, is a decomposition of n~ ‘(U) into its (open) connected com- 
ponents and n | U, is a diffeomorphism onto U for each component Ŭ,. 


Proof The manifold M is Hausdorff since I acts properly discontin- 
uously. By Lemma 2.3 it has a countable basis of open sets. Using both (i’) 
and the assumption that the action is free, we may find for each xe Ma 
neighborhood U such that hU a U = Ø except when h = e. This implies 
that zg (= z | U) is one-to-one onto its image U, and therefore mj: U > Uis 
a homeomorphism of U to the open set U—the mapping x being both 
continuous and open. There is no loss of generality in supposing U to bea 
connected coordinate neighborhood U, @. Then taking ọ = @° 1%’, we 
have ọ: U > @(U) = R" is a homeomorphism. Since every pe M is the 
image of some xe M, we see that M is locally Euclidean. Thus M is a 
topological manifold. The coordinate neighborhoods U, @ just described 
will be called admissible: If they are C” compatible, then they determine a C® 
_ structure on M for which z is C®, in fact is a local diffeomorphism. Suppose 
that U = x(0) and V = n(V) are overlapping admissible neighborhoods with 
coordinate maps ọ = ong! and y = Ņ} onz’. Although UNV may be 
empty, there must be an h eT such that U nhV# Ø. Since n = r ° h, we may 
write poi=nop !=nchow t; hence pow |= Gong’ onoho! = 
echo. These are all C® maps, in fact diffeomorphisms, and hence 
gow! is C”. The same holds for ye p~', which proves that the admissible 
neighborhoods U, ¢ define a C” structure. The coordinates of p € U and 
F(p)eU being exactly the same, r| U is a diffeomorphism, and hence 
n= noh is a diffeomorphism of hU onto U for any heT. Since {z7 *(U)} 
covers M, nz: M > M is a C” map and a local diffeomorphism. Because of the 
requirement that (0) be a diffeomorphism, no other C® structure is 
possible. i 
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We remark that z is C” of rank n = dim M = dim M since it is locally a 
diffeomorphism. Of course, it can be one-to-one only when I = fe} for 
z~ !(p) = Tx for some x, and this orbit x is in one-to-one correspondence 
with I itself by virtue of the assumption that I acts freely. 

We shall prove a lemma and then a theorem which will supply some 
examples of free, properly discontinuous action. 


(8.4) Lemma Let G be a Lie group and T an algebraic subgroup. Then there 
is a neighborhood U of the identity e such that T ^ U = {e} if and only if T is a 
discrete subspace, in which case T = I, i.e., T is closed and countable. 


Proof Clearly, if I is a discrete subspace, then every heľ has a 
neighborhood V in G such that V VT = {h}. Conversely, if e e I has such a 
neighborhood, say U, then for any he T, V = AU contains only the element h 
of T. It is even possible to choose then a neighborhood W of e such that 
WW-!cU, in which case hW ^ h, W = Ø unless h, = hp. For if hw; = 
h,w, is the intersection, then hy'h, =w,w,;'e¢WcU which implies 
hz 'h, =e or h, = h. Then I must be countable since {hW|heT} is a 
nonintersecting family of open sets in one-to-one correspondence with I. Any 
such family is countable since G satisfies the second axiom of countability. 
Finally, if is not a closed subset, it is easy to see that there must be a 
sequence {h,} c T with limp» h, = e. This contradicts the existence of U 
with U nT = {e}. I 


We remark that a T with this property is a closed zero-dimensional Lie 
subgroup of G in the sense of Definition 6.17; such subgroups are often 
called simply discrete subgroups. We give examples below. 


(8.5) Theorem Any discrete subgroup I of a Lie group G acts freely and 
properly discontinuously on G by left translations. 


Proof No other translation than the identity has a fixed point so the 
action is free. To see that it is properly discontinuous we must check (i) and 
(ii) of Definition 8.1. Choosing U, V neighborhoods of e as in the proof of 
the preceding lemma so that VVT! c U and U AT = fe}, we see that the 
only he T such that hV n V + Ø ish = e. This proves (i). To prove (ii) we 
argue as follows. If Px and Ty are distinct orbits, then x ¢ Ty, and since T y is 
closed, by the regularity of G there is a neighborhood U of x such that 
U aly = Ø. Let V bea neighborhood of e such that xVV~? c U. If the 
open sets [xV and I'yV intersect, then some element of xVV~* must be in 
Iy, which is an immediate contradiction. This completes the proof. | 


(8.6) Corollary IfT is a discrete subgroup of a Lie group G, then the space 
of right (or left) cosets G/T is a C® manifold and n: G > G/T is a C” mapping. 
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This is a combination of Theorems 8.3 and 8.5. It may also be considered 
as a generalization of Theorem 7.12 since I is a closed subgroup of G. 


(8.7) Example A particularly important example is the following. Let 
G = V’, that is, R” considered as a vector space, and let l = Z”, the n-tuples 
of integers—usually called the integral lattice. [More generally one could 
take for I’ the integral linear combinations of any basis f,,..., f, of V".] isa 
discrete subgroup; the neighborhood C7(0) of the origin withe < 1 does not 
contain any element of I’ other than (0, ..., 0). The reader should verify that 
v"/T = V"/Z" is diffeomorphic to T" = St x+- x S’, the n-dimensional 
torus, and that z is a Lie group homomorphism of V” onto T” with F as 
kernel. 


(8.8) Example Any finite subgroup F of a Lie group G is a discrete 
subgroup. When C is compact, a discrete subgroup must be finite; but even 
in this case there are interesting examples. Thus in the case of SO(3), the 
group of 3 x 3 orthogonal matrices of determinant +1, the subgroups of 
symmetries of the five regular solids give examples among which is the 
famous icosahedral group, which contains 60 elements. (See Wolf [1, 
Section 2.6].) 


(8.9) Example In the case of groups which are not compact we have 
many variations of the following theme: Let G ) = Gi(n,R) and 
To = Si(n, Z), the n x n matrices with integer coefficients and determinant 
+1. Since the topology of Go is obtained by considering it as an open subset 
of R”, it is clear that Ty corresponds to the intersection of Go with the 
integral lattice Z” and hence is discrete. Having said this, suppose G to be a 
Lie subgroup of Gy and let F = Fo A G. Then F is discrete in G. An illustra- 
tion is the following: Let G be all matrices in Gl(n, R) with +1 on the main 
diagonal and zero below and let F be its intersection with Si(n, Z). 


An interesting question about which one can speculate is the following: 
In which, if any, of these cases is G/T compact? Note that it is compact when 
G = V" and T = Z". A necessary and sufficient condition for compactness is 
the existence of a compact subset K c G whose IT -orbit covers G, FK = G. 
In Example 8.7, any cube K of side one or greater has this property. 

We terminate by mentioning some examples related to “ tiling ” the plane 
and to crystallography. Note that reflection in a line is a rigid motion of the 
plane, and in fact any rigid motion is a product of reflections, which thus 
generate the group of motions of the plane. For example, the group F 
generated by reflections in the four lines x = 0, x = 3, y = 0, y = ł relative 
to a fixed Cartesian coordinate system contains the group of translations 
(x, y) > (x + m,y +n), m,n integers. This latter group may be identified 
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d f 


Figure III.12 


with the subgroup Z? of V? discussed above. Moreover the action of T 
leaves unchanged the figure consisting of lines x = k/2, y = 1/2, k, l integers, 
that is, a collection of squares which “tile” the plane. 

Similarly, if we tile the plane with other polygons as in Fig. III.12, we see 
that the group T of reflections in all lines forming edges of these polygons 
leaves the whole configuration or tiling unchanged. The reader can verify 
geometrically that the group T in these illustrations acts properly discontin- 
uously. Is the action free? This is an important method of obtaining such 
group actions. It has an immediate extension to E>, where it is intimately 
related to crystal structures. (See, e.g., Grove and Benson [1].) 


Exercises 


1. Let I be an abstract group acting as diffeomorphisms on a C”? manifold 
M and assume that I, is finite for every pe M. If the orbits are closed, 
discrete subsets of M, then prove that T is necessarily countable and the 
action discontinuous. 

2. Suppose that a C” manifold M is a metric space and that T is a discrete 
group of C® isometries acting discontinuously on M. Show that the 
action is necessarily properly discontinuous. 

3. (a) Show that (i) may be replaced by (i’) in Definition 8.1; and 
(b) show that (ii) may be replaced by (ii): M/T is Hausdorff. 

4. Show that the only closed nontrivial subgroups of R are of the form 
T = {na|a #0, ne Z}. Determine all closed subgroups of R”. 

5. Verify the statements of Example 8.2. 

6. Let G consist of all 3 x 3 matrices which have + 1 along the diagonal 
and zero below and F the matrices in G with integer entries. Show that I 
is a closed discrete subgroup and G/T is a compact Hausdorff space. 

7. Prove that a discrete normal subgroup of a connected Lie group G is in 
the center of G. 
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9 Covering Manifolds 

Some of the examples of the previous section are intimately related to the 
notion of covering manifolds. Let M and M be two C” manifolds of the 
same dimension and n: M > M a C” mapping. Using this notation, we 
make the following definition: 


(9.1) Definition M is said to be a covering (manifold) of M with covering 
mapping x if z is onto, M is connected and if each pe M has a connected 
neighborhood U such that 2~'(U) = |] U,, a union of open components 
U,, with the property that zy, the restriction of x to U,, is a diffeomorphism 
onto U. The U are called admissible neighborhoods and 7 is called the 
projection or covering mapping. 


Examples abound in the previous section: M = R covers M = S! 
realized as complex numbers of absolute value +1 with x(t) = exp 2zit. 
(This may be visualized as in Fig. III.13 with z the projection to the circle S+.) 
More generally M = R" covers T”. Example 8.2 shows that S"~? covers 
P"~'(R) and in a very general way Theorem 8.3 tells us that if T acts freely 
and properly discontinuously on M, then M covers M = MJT. Here the 
map z is the obvious one: it takes each x € M to its orbit Px which is a point 
of M. 

It would seem at first glance that the examples of covering manifolds 
must be much more extensive than those furnished via Theorem 8.3. We 
shall see much later that this is not the case. At present we can only begin to 
show how this is demonstrated. Let us assume then that z: M —> M is any 
covering of a manifold M by a connected manifold M. We indicate how this 
may give rise to a group Č acting freely and properly discontinuously on M. 


(9.2) Definition A diffeomorphism h: M > M is said to be a covering 
transformation, or deck transformation, if n o h = n. 


Note that this is equivalent to the requirement that each set z~ '(p) is 
carried into itself. In case the covering is one arising from free, properly 
discontinuous action of a group T on M, then each her is a covering 
transformation of the covering z: M > MJT. More generally, the set f° of all 
covering transformations acts as a group of diffeomorphisms on M. It 
contains at least the identity so it is not empty. 

Given any x € M and p = n(x), let U be an admissible neighborhood of p 
so x 1(U)=(\) U,, where a = 1, 2,... (the collection of mutually disjoint 
neighborhoods {U,} must be countable). Since the U, are connected com- 
ponents of n~ +(U), either h(U,) 0 Ux = ¢, or h(U,) = Uy and h: 0, > U,, is 
a diffeomorphism—with h| Ū, = nū. ° mg,. We can conclude that the points 
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Figure I1.13 


left fixed by h form an open set. By continuity of h they also form a closed set, 
and— M being connected —this set is empty or h is the identity. In particular, 
two covering transformations with the same value on a point x must be 
identical. Thus covering transformations are completely determined by the 
permutation x— a’ they induce on the set of points {x,} =" ~*(@) for an 
arbitra (but fixed) point p e M. In particular, the action of Ť on M is free. If 
XEN 1(p), then h > hx, maps I into n` !(p). This mapping is an injection so 
f must be countable, and as a discrete group of diffeomorphisms of M, it acts 
differentiably on M. This proves, in part, the following theorem: 


(9.3) Theorem With the notation above, I acts freely and properly discon- 
tinuously on M and the natural map 7: M = MIT is a covering mapping. IfT is 
a transitive on n~'(p) for every p e M, then there is a natural identification of M 
with M/T and of n, with n. 
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Proof We have already seen that I acts on M freely since only the 
identity has a fixed point. We must check (using admissible neighborhoods) 
that the action is properly discontinuous. If xe M and p = x(x), then 
xe{x,}=2°'(p), say x= xı, and if h#e, then h(x,) =x, # x, so 
h(O,) = U, with O, o 0, = Ø. Thus part (i) of proper discontinuity is 
proved. 

For part (ii) we take x, ye M not in the same orbit of f and con- 
sider two cases: either x(x) = n(y) or not. If they are the same, denote 
this point by p, as above, and note that in permuting {x,} = 77 !(p), no h € Ÿ 
takes x = x, to y = xş (a + p); whence U, is not carried to U, by any he. 
This establishes (ii) of Definition 8.1 in this case. When, on the other hand, 
n(x) = p and z(y) = q are distinct, it is even easier. Let U, V be disjoint 
admissible neighborhoods of p, q, respectively. Then the open sets 2” '(U) 
and x '(V) are disjoint and carried into themselves by every h € Ï, so they 
answer to requirement (ii). Thus the action is properly discontinuous and the 
first part of the theorem follows from Theorem 8.3. 

To prove the second part of the theorem we define 2,:M/T > M by 
m([x]) = m(x), where xe M and [x] is its T-orbit. Since n(x) = n(h(x)) for 
any heŬ, z, is well-defined and x = 2,07,. We leave it as an exercise to 
verify that z,:M/T — M is a covering. If f is transitive on 2~'(p) for each 
pe M, then x~‘(p) is a -orbit and z3 '(p) is a single point of M/T. In this 
case the latter part of the theorem is a consequence of the following easy 
lemma applied to z,. I 


(9.4) Lemma Let n: M —>M be a covering and suppose that for some 
pe M, x~‘(p) is a single point. Then n is a diffeomorphism. 


Exercises 


1. Prove Lemma 9.4 by using the connectedness of M. 

2. Show that z,: M/T —> M as defined above is a covering. 

Show that the covering transformations form a group and that if 

x, ye M, a covering manifold of M, then there is at most one covering 

transformation taking x to y. Show further that if f is transitive on 

z` (p) for some pe M, then it is transitive for every p. 

4. Let z: M >M be a covering and X a connected space F: X >M a 
continuous mapping. Suppose F,, F,: X > M are continuous, have the 
property that mo F;=F, and agree on one point of X. Show that 
F =F). 

5. Let x: M —> M bea covering and F: [a, b] > M a continuous curve from 
F(a) = p to F(b) = q. If xo € a ' (p), show that there is a unique contin- 
uous curve F: [a, b] + M such that F(a) = xo and z o F = F. 
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6. Let I = [0, 1] be the closed unit interval and I" = I x -- x I, the n-fold 
Cartesian product. Suppose F:I"—M is continuous with 
p = F(O,..., 0). If z: M > M is a covering and xen“ (p), then prove 
that there is a unique continuous map F: I" > M such that F = n° F 
and F(0, v0 0) = x. 

7. Let Ñ, M be C” manifolds of dimension n and n: M > M a C” map 
which is onto and has rank n at each point. Prove or disprove the 
statements: (a) z is locally a diffeomorphism; (b) x is a covering map. 


Notes 


The concept of differentiable manifold which was presented in this chapter is the result of 
many influences and the work of many great mathematicians beginning with Gauss and Riem- 
ann. In its present form it is of fairly recent creation. The early work in differential geometry was 
of a local character—hence open subsets of Euclidean space were adequate models. Even the 
space of non-Euclidean geometry, as a manifold, is equivalent to Euclidean space—it is only the 
metric aspects of the geometry which are different. The same is true of Lie groups, as studied by 
Lie, since only group germs or local Lie groups, that is, neighborhoods of the identity, were 
considered. Except for Riemann surfaces and projective spaces, there was little to force the 
global aspects of manifolds into prominence. Nevertheless it seems clear that Riemann, and 
certainly Poincaré, conceived of manifolds in essentially the same way that we do today although 
without the present precise separation between the underlying topological space and the 
differentiable structure. In fact the concept was so difficult that even many great mathematicians 
—well into this century —were unable to grasp the concept of a topological or differentiable 
manifold. This depended on the idea of overlapping coordinate neighborhoods, perhaps first 
clearly expressed in the book on Riemann surfaces by Weyl [1] published in 1923 and brought to 
its present form in the seminal paper of H. Whitney [1] published in 1936. This paper was 
particularly important in that it showed that there was no essential distinction between an 
abstract manifold (Def. 1.2) and a regular submanifold of Euclidean space (cf. Theorem V.4.7). 

Poincaré, whose imprint is everywhere in this subject, studied manifolds from many points 
of view: as phase spaces of dynamical systems, as Riemann surfaces (in which covering spaces 
and discontinuous groups played an important role), and from the aspect of algebraic topology. 
(For discussion of these topics the reader can consult the survey article by Smale [1] and the 
books of Siegel [1] or Lehner [1].) All of the work of Cartan on Lie groups and differential 
geometry (see Chern and Chevalley [1]) has had an enormous influence on the subject as has the 
enormous development of algebraic topology, beginning especially with the work of L. E. J. 
Brouwer and H. Hopf. The latter was particularly interested in the interplay between topological 
invariants and differential geometric properties, beginning with his paper (Hopf [1]) in 1925. 

The work of the past 50 to 60 years is far too extensive to summarize here, however one 
recent development is of special relevance to this chapter. One basic question left over from the 
foundations of the subject was a better understanding of the relation between the underlying 
topological structure of a manifold and its differentiable structure. It was long believed that, up 
to diffeomorphism, only one differentiable structure could be defined on a topological manifold. 
In a famous paper in 1956 Milnor [4] showed that this was not in fact the case and that S7 was a 
counterexample. Soon after Kervaire and Milnor [1] showed that many spheres fail to have a 
unique C® structure (see their tabulation below). Then in 1961 Kervaire found an example of 
topological manifold of dimension 10 with no differentiable structure at all! Finally, very recently 
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it has been shown that every Euclidean manifold R”, with the exception of R*, has a unique 
differentiable structure but that the exception R* has many (see Freedman [1] and Gompf [1]). 

A very readable introduction to some of these questions may be found in Milnor [5], a 
book which also contains a discussion of the Poincaré conjecture by R. H. Bing. 


THE DIMENSION n AND NUMBER m OF NONDIFFEOMORPHIC 
STRUCTURES ON S” 


n| 2 3 45 6 7 8 9 10 lf 12 13 14 15 16 


mi|1 1? 1 21 1 28 2 8 6 992 1 3 2 16256 2 
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IV VECTOR FIELDS ON A MANIFOLD 


In this chapter we introduce some of the most basic tools used in the study of differentiable 
manifolds. First we define the tangent space T,(M) attached to each pe M, M a C” manifold. 
Each element X, of T,(M) can be considered as an operator (directional derivative) on 
C*®-functions at p, generalizing one of the definitions in the case of R”. We also see that a C® 
mapping F: M > N induces a linear map F,: T,(M) > Trip (N) on the tangent space at each 
point. Associated to a coordinate system around p will be a basis of T,(M). If F is the expression 
for F in local coordinates around p and F(p), then the Jacobian DF is the matrix of F, with 
respect to these bases. 

Assigning a vector X „to each pe M we obtain a vector field on M, just as in the special case 
R”. Vector fields are intimately associated with the action of the Lie group R on M, that is, 
one-parameter transformation groups. The relation between them is a consequence of—and in 
some sense equivalent to—the fundamental existence theorem for solutions of systems of ordin- 
ary differential equations. Section 2 gives the basic definitions of vector fields, Section 3 the 
basic definitions of one-parameter transformation groups acting on a manifold and Section 4 
the existence theorem. Systems of ordinary differential equations are shown to coincide with 
vector fields on manifolds and their solutions with curves on the manifold which are tangent to 
the vectors of the field. These curves are also orbits of the group action. In Section 5 a number 
of examples are given, using Lie group action as a starting point. 

In Section 6 a study is made of one-parameter (one-dimensional) subgroups of a Lie group 
G. These are basic in the study of Lie groups, but we use them primarily as a source of examples 
and to illustrate the basic ideas above. They are in one-to-one correspondence with the vectors 
X,€ T,{G) the tangent space to G at the identity element and, in the case of matrix groups, are 
easily obtainable in terms of the “exponential” of a matrix. 
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Section 7 is concerned with the set ¥(M) of all C*-vector fields on M. It is a vector space 
over R and a module over the C® functions on M. Moreover it has a naturally defined product, 
the bracket [X, Y}. Using all these ideas one is able to define the Lie derivative Ly Y of the 
vector field Y in the direction of the field X ; this gives a new vector field dependent on X and Y. 
This derivative results from the group action associated with X, which enables us to compute 
the change in Y as we move along the orbit. 

The final two sections give Frobenius’ theorem, a very basic existence theorem in manifold 
theory, and some applications. These two sections can be omitted on a first reading; they are 
important but we make relatively few applications of them. For a first reading, in fact, 
Sections 1-4 are the most crucial. 


1 The Tangent Space at a Point of a Manifold 


Let M denote a C” manifold of dimension n. We have defined for M the 
concepts of C” function on an open subset U and of C” mapping to another 
manifold. This allows us to consider C*(U), the collection of all C” func- 
tions on the open subset U (including the special case U = M), and to 
verify—as we did for U c R"—that it is a commutative algebra over the real 
numbers R. As before, R may be identified in a natural way with the constant 
functions and the constant 1 with the unit. Given any point pe M we 
may—as for R"—define C®(p) as the algebra of C” functions whose domain 
of definition includes some open neighborhood of p, with functions 
identified if they agree on any neighborhood of p. The objects so obtained 
are called “germs” of C® functions (Exercise 1). Choosing an arbitrary 
coordinate neighborhood U, ọ of p it is easily verified that p*: C°(@(p)) > 
C”(p) given by y*(f) = fe g is an isomorphism of the algebra of “ germs” 
of C” functions at ọ(p)e R" onto the algebra C™(p). This is to be expected 
since locally M is C®-equivalent to R" by the diffeomorphism g. Our main 
purpose is to attach to each pe M a tangent vector space T,(M), as was done 
for R" and E". [See Fig. IV.1 for the geometric idea of T,(M).] Although our 
first definitions in the latter case giving T,(R") as directed line segments do 
not generalize, the identification (based on Theorem II.4.1) of T,(R") with 
directional derivatives does. 


(1.1) Definition We define the tangent space T,(M) to M at p to be the set 
of all mappings X,: C*(p) > R satisfying for alla, Be R and f, g € C” (p) the 
two conditions 


(i) X,(of + Bg) = o(X,f) + B(X,g) (linearity), 
(ii) X,(f9) = (X,f)9(P) + f(p(Xpg) (Leibniz rule), 
with the vector space operations in T,(M) defined by 
(X,+ Y) =X f+ YS 
CXS = X, f). 
A tangent vector to M at p is any X „e T,(M). 
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Figure 1V.1 


One should check that this does in fact define a vector space T,(M) at 
each point p of M. Although we are now dealing with C” functions on a 
manifold, formally the proofs are the same as in Section II.4 where it was 
established that 2(a), the mappings of C®(a) to R having properties (i) and 
(ii), was a vector space. 

We remark that the definition of T,(M) uses only C”(p), not all of M; 
thus if U is any open set of M containing p, then 7,(U) and T,(M) are 
naturally identified. Of course, our proof that T,(M) is a vector space in- 
cludes the earlier case of R”, the difference is that we no longer have the 
alternative “geometric” way of defining T,(M) as pairs of points px as we 
did in R”, because that method used special features of R", namely the 
existence of a natural one-to-one correspondence with the vector space V”. 
For manifolds in general, any such correspondence entails a choice of a 
coordinate neighborhood and depends on the particular neighborhood 
selected; so it is not natural in the sense we have used the term. However, for 
each choice of coordinate neighborhood U, ọ containing p€ M we obtain an 
isomorphism to V” as we shall see. It is by this method that we can establish 
that dim T,(M) = dim M. 


(1.2) Theorem Let F: M > N be a C” map of manifolds. Then for pe M 
the map F*: C*(F(p)) > C®(p) defined by F*(f) = f° F is a homomorphism 
of algebras and induces a dual vector space homomorphism F,,: T,(M) > 
Trip(N), defined by F,(X,)f = X,(F*f), which gives F(X ,) as a map of 
C*(F(p)) to R. When F: M —> M is the identity, both F* and F, are the 
identity isomorphism. If H = Go F is a composition of C” maps, then 
H* = F* o G* and H, = G,° F}. 
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Proof The proof consists of routinely checking the statements against 
definitions. We omit the verification that F* is a homomorphism and con- 
sider F, only. Let X,€ T,(M) and f, ge C*(F(p)); we must prove that the 
map F,(X,): C*(F(p)) > R is a vector at F(p), that is, a linear map satisfy- 
ing the Leibniz rule. We have 


F(X (Ja) = X F*( fa) = X (f° F)g ° F)] 
= X (f° F)g(F(p)) + S(F(p))X,(f° G), 
and so we obtain 


F (Xa) = (F(X) )o(F(p)) + S(F@))F (Xp) 


(linearity is even simpler). Thus F,: T,(M) > Trp (N). Further, F, is a 
homomorphism 


F (aX, + BY,)f = (aX, + BYS °F) = 4X (f ° F) + BYS ° F) 
= aF (X ,)f + BF (Y,)f 


(1.3) Remark The homomorphism F,: T,(M) > Trp (M) is often called 
the differential of F. One frequently sees other notations for F, , for example, 
dF, DF, F', and so on. The x is a subscript since the mapping is in the same 
“direction” as F, that is, from M to N, whereas F*: C”(F(p)) > C®(p) goes 
opposite to the direction of F. This notational convention can be quite 
important and reflects a similar situation in linear algebra related to linear 
mappings of vector spaces and their duals. 


Although, once definitions are correctly made and rather mechanically 
applied, the statements above have trivial proofs, nonetheless they are most 
important and useful, even if M and N are Euclidean spaces. We shall 
consider some of the consequences now. 


(1.4) Corollary If F: M >N is a diffeomorphism of M onto an open set 
U c N and pe M, then F: T,(M) > Tpg(N) is an isomorphism onto. 


This follows at once from the last statement of the theorem and the 
remark after Definition 1.1 if we suppose G is inverse to F. Then both 
G, ° F,: T,(M) > T,(M) and F, ° Gy: Trg(N) > Trgp(N) are the identity 
isomorphism on the corresponding vector space. 

Remembering that any open subset of a manifold is a (sub)manifold of 
the same dimension, we see that if U, ọ is a coordinate neighborhood on M, 
then the coordinate map g induces an isomorphism 9, : T,(M) > Typ (R") 
of the tangent space at each point pe U onto T,(R"), a = o(p). The map g~}, 
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on the other hand, maps T,(R") isomorphically onto T,(M). The images 
E; = 9x (ĉ/2x’), i = 1,..., n, of the natural basis ĉ/ĉx!, ..., ĉ/ĉx" at each 
ae g(U) = R” determine at p = 9 ‘(a)e M a basis E,,...., Enp of T,(M); 
we call these bases the coordinate frames. 


(1.5) Corollary To each coordinate neighborhood U on M there corre- 
sponds a natural basis E,,,.-- Enp Of T,(M) for every pe U; in particular, 
dim T,(M) = dim M. Let f be a C™ function defined in a neighborhood of p, 
and f = fe! its expression in local coordinates relative to U, y. Then 
E; = (Cf/ex')oip- In particular, if x'(q) is the ith coordinate function, X , x‘ is 
the ith component of X „ in this basis, that is. X, = Py (X, x’)Ep 


The last statement of the corollary is a restatement of the definition in 
Theorem 1.2 for E;, = 9, '(0/ĉx'), namely, 
PE 


Epf= (lá) Z feg’ 


If we take f to be the ith coordinate function, f(q) = x'(q) and 
X, = } wE,,, then 


i i j ex’ i 
X,x = Z Ep) = Z (5) = gi. 


j o(p) 


x=ọ(p) 


We may use this to derive a standard formula which gives the matrix of 
the linear map F, relative to local coordinate systems. Let F:M > N bea 
smooth map, and let U, ọ and V, w be coordinate neighborhoods on M and 
N with F(U) c V. Suppose that in these local coordinates F is given by 


yi = fi(x},..., xX")  i=1,...,m, 


and that p is a point with coordinates a = (a’,...,a"). Then F(p) has y 
coordinates determined by these functions. Further let éyi/éx' denote 
of t/dx'. 


(1.6) Theorem Let E,, = p} '(3/0x') and jr = Wy (0/0), 1 = 1s... 
and j = 1,..., m, be the basis of T,(M) and Trp(N), respectively, determined 
by the given coordinates neighborhoods. Then 


m 6 = ; 
F(E) = È (ax) Bip i232 ered 
j=l a 


In terms of components, if X , = >, «E; maps to F(X p) = © BE jmp) then we 
have 
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The partial derivatives in these formulas are evaluated at the coordinates of 
p: a= (a',...,a") = o(p). 

Proof We have F,(Ej,) = F,° 9; '(0/0x'),p, and according to 
Corollary 1.5, to compute its components relative to Erg, we must apply 
this vector as an operator on C%(F(p)) to the coordinate functions y/ 


êa, ð, E afi 
Filey = (Fx ° Oy (r= zarf =o 1)(x) = axl? 


these derivatives being evaluated at the coordinates of p, that is, at ~(p); they 
could also be written (dy//dx'),,,). 


We now obtain two corollaries to Theorem 1.6, in the first, F, M, and N 
are as in Theorem 1.6. 


(1.7) Corollary The rank of F at p is exactly the dimension of the image of 
F,,(T,(M)). Fẹ is an isomorphism into if and only if this rank is the dimension 
of M; it is onto if and only if the rank equals dim N. 


Proof We obtain this immediately from linear algebra since (0y'/0x’) is 
exactly the Jacobian of yo Fg” ', which we used to define the rank in 
Definition IJI.4.1, and is also the matrix of the linear transformation 
F,,: T,(M) > Trp N in the given bases. a 


This corollary gives a characterization of the rank which is independent 
of any coordinate systems, a situation toward which we constantly strive in 
studying properties of various objects on manifolds. 

If we apply the theorem to the maps F = @° g ‘and F°'=gc°@"' 
which give the change of coordinates from U, g to U, @ in U ^ U on M, 
then we obtain formulas for change of basis in T,(M) and the corresponding 
change of components relative to these bases. 


(1.8) Corollary Let peUnU and let Ey = py" (6/ex') and 
Ein = Q, '(6/0X') be the bases of T,(M) corresponding to the two coordinate 
systems. Then with indices running from 1 to n, we have 


ox* m x ox! 
=i) £ d È=), (Š) E 
= da (Sho s j = z EN i 
If X, = } dEi = } BiE;,, then 
, 6x! : Ox 
t= š J — = F d Pace 
= 558 agi and f=Yi0 Axi 
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The proof is left as an exercise. The second set of formulas is often used to 
define tangent vector at a point p of a manifold: a tangent vector X, is an 
equivalence class of the collection of all n-tuples {(a',...,0")u, pl E R, U,@ 
a coordinate neighborhood of p}; two such n-tuples (a, ..., &")u, ọ and 
(BŁ, ..., B")g.g being equivalent if they are related as in last formula of 
Corollary 1.8 (see Exercise 4). 

We may apply Corollary 1.7 to the following situation: M is a submani- 
fold of N with F: M > N the immersion or inclusion map of M into N. In 
either case, the mapping F from M (with its C” manifold structure) into N 
(with its C” structure) isa C” mapping, and rank F = dim M. This means 
that F,: T,(M)— T,(N) is an injective isomorphism so that T,(M) can be 
identified with a subspace of T,(N). This identification being made we can 
think of 7,(M), the tangent space to M, asa subspace in T,(N) for each 
peM. Applying this principle to our examples of submanifolds of R”, 
especially when n = 2 or 3, will enable us to recapture some of the intuitive 
meaning of tangent vector which was lost in the transition from Euclidean 
space to general manifolds. Of course this applies only to those manifolds 
which can be realized as, that is, are diffeomorphic to, a submanifold of R”. 


(1.9) Example Consider the case of a C® curve F: M > N ina manifold, 
where M = (a,b) is an open interval of R; for the moment we drop the 
requirement that F is an immersion. Given tọ€ M, a < to < b, then d/dt 
taken at tọ is a basis for T,,(M). Suppose p = F(to) and fe C”(p), then 
F ,(d/dt) is determined by its value on all such f: 


ràg- (i TOR 


We shall call this vector the (tangent) velocity vector to the curve at p. 
(Fig.IV.2). In this interpretation we use the parameter te R as time, and we 
think of F(t) as a point moving in N. 

In particular, if U, g are coordinates around p, then in the local coordin- 
ates F is given by F(t) = ọ ° F(t) = (x'(t), --.. x"(t)). The ith coordinate x! is 
a function on U and using somewhat sloppy notation, we write x(t) = 
(x! o F)(t); thus F,(d/dt)x' = (dx‘/dt),,, which we denote Kt = 1,..., lt 
So by Theorem 1.6 (with E,, = d/dt and E’s replacing E’s), 


F.(3) = S(t) Ei 


i=1 


Now as a special case let N = R”. With the usual (canonical) coordinates 
of R" this formula means that the image of d/dt is just the velocity vector at 
the point p = (x'(to), ---, X"(to)) of the curve. Its components relative to the 
natural basis at the point p are x! (to), ..., X"(to); it is the vector of T,(R") 
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Figure IV.2 


whose initial point is p = x(tọ) and whose terminal point is (x'(to) + x'(to), 
w+) X"(to) + X"(to)). If the rank of F at to is 1, then F, is an isomorphism and 
we may identify the tangent space to the image curve at p with the subspace 
of T,(R") spanned by this vector, thus obtaining the usual tangent line at the 
point p of the curve. If the rank of F at tọ is 0, then F,(d/dt) = 0. 


(1.10) Example We now suppose M to be a two-dimensional submani- 
fold of R?, that is, a surface. Let W be an open subset, say a rectangle in the 
(u, v)-plane R? and 0: W > R? a parametrization of a portion of M 
(Fig.IV.3). Namely, suppose 0 is an imbedding whose image is an open 
subset V of M; V, 07} is a coordinate neighborhood on M. Suppose 
O(uo, vo) = (Xo, Yo: Zo), Where we now use (x, y, z) as the natural coordin- 
ates in R?. We may assume that @ is given by coordinate functions 


x=f(u, v), y = g(u, v), z = h(u, v). 


Since @ is an imbedding, the Jacobian matrix 0(f, g, h)/é(u, v) has rank 2 at 
each point of W. We consider the image of the basis vectors d/éu and 0/év at 
(uo, Vo). We denote these by (X„)o and (X,)o. According to the first formula 
of Theorem 1.6, they are given by 


6 0x Gd éyG zô 
= 0 = 
(Xu)o (5) Oudx Oudy dudz’ 


0 ôx ĉyð zô 
(X.J = AE ~ ĝvôx ôvôõy  ôvôz’ 
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u 


Figure IV.3 


where we have written @x/éu, 0x/év for ôf/ðu, 6f/dv, and so on, these deriva- 
tives being evaluated at uo, vo. Since 0, has rank 2, these are linearly 
independent vectors, and they span a two-dimensional subspace of 
Tixo. yo. zo(R°). This subspace is what we have, by our identification, agreed 
to call the tangent space of M at the point (xo, Yo, Zo); it consists of all the 
vectors of the form a6,(0/éu) + B0,(G/dv) = a(Xy)o + B(X,)o, %& BER; 
their initial point, of course, is always at (Xo, Yo, Zo). It is easily seen that this 
subspace is the usual tangent plane to a surface, as we would naturally 
expect it to be. We use one of the standard descriptions of the tangent plane 
at a point p of a surface M in R°: the collection of all tangent vectors at p to 
curves through p which lie on M. In fact let J be an open interval about 
t = to and let us consider a curve on N through (xo, Yo: Zo). It is no loss of 
generality to suppose the curve given by F: I > W composed with 0: W > R?; 
thus u, v, are functions of t with u(t) = uo and v(to) = to and the curve is 
given by 


O(F (t)) = (x(u(t), v(t)), y(u(t), v(t), z(u(t), v(t))). 
The tangent to the curve at (xo, Yo, Zo) is given by 
O- P(E) = Steg, + eds, + HO g 


where 


£(to) = dx Onl. Ordo 
~% \dt},, Gudt ` évdt 
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evaluated at (xo, Yo, Zo) and t = fg. Substituting and collecting terms, we 
have 


d du({éx ð éyd zð 
ve Pada) ~ dt & ax | Gudy ` du 3 

dv (0x ô eo gyae 

dt\dvéx évéy évéz 


= U(to(Xuo + Oto NX, )o- 


If we let u = t, v = vo, we obtain just (X,)o = 6,(6/du) and analogously 
(X,,)o is tangent to the parameter curve u = ug, v = t. The coordinate frame 
vectors are tangent to the coordinate curves. 

This could also be derived directly from the relation (0° F), = 0, ° Fy 
of Theorem 1.2. This means that the (tangent) velocity to every curve in M 
through p = (Xo, yo, Zo) lies in the subspace T,(M) = T,(R*) spanned by 
(X,,)o and (X,)o. Conversely by suitable choice of the curve every vector of 
T,(M) may be so represented. 


Exercises 


1. Let F, be the family of C” functions f on open sets W, of M which 
contain the point p. Define a relation ~ on ¥, by f~giff=gona 
neighborhood of p. Show that ~ is an equivalence relation and that the 
equivalence classes, called germs of C” functions at p, form an algebra 
C(p) with unit over R. 

2. Let F:M—>N be a C” mapping of manifolds. Show that 

f— F*(f) = F o f defines a homomorphism F*: C*(F(p)) > C*(p) and 

prove the statements of Theorem 1.2 about F*. 

Prove Corollary 1.8. 

4. For peM let @ be the collection of all coordinate neighborhoods con- 
taining p. Let (a,,...,%)y,, and (f;,.-., B,)g,g be objects consisting of 
an element of R” together with—or labeled by—a coordinate neighbor- 
hood of €. They will be called equivalent if they correspond by the 
formulas of Corollary 1.8. Show that this is an equivalence relation and 
that the classes form a vector space naturally isomorphic to T,(M). 

5. Using the notation of Example 1.10, show that for any a, Be R there is a 
parametrized curve on M through p whose velocity vector is exactly 
a(Xu)o + B(Xa)o- 

6. If the surface of Example 1.10 is given in the form z = h(x, y) with 
Zo = h(x, Yo), then show as a special case of our discussion that with 
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suitable parametrization the tangent plane Tixo, yo, -9)(M) consists of all 
vectors from (Xo, Yo, Zo) to points (x, y, z) satisfying 


(=), — Xo) + (5),0 aeons 


7. Let N c M bearegular submanifold and U, ọ bea preferred coordinate 
neighborhood relative to N with local coordinates (x’,..., x") and 
frames (E,,..., Em). If NU is given by x"*! =-~ = x” = 0, show 
that E,,,..., Enp is a basis of T,(N) for every pe N a U. Modify this 


statement so as to include immersed submanifolds. 


2 Vector Fields 


In a previous paragraph (Definition 1.1), we defined the notion of a 
tangent vector to a manifold at a point pe M, that is, an element X, of 
T,(M). In this section we will define and give examples of a C’-vector field on 
M,r > 0. A vector field X on M is, first of all, a “function” assigning to each 
point p of M an element X, of T,(M) (see Fig. IV.4). We place the word 
“function” in quotation marks since we have not really defined its range, 
only its domain M. The range is, in fact, the set nas consisting of all 


tangent vectors at all points of M, T(M) = |]pem T,(M 


Figure IV.4 
Vector field X on M. 
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For future reference we note some properties of T(M). It is a set that is partitioned into 
disjoint subsets {T,(M)} which are indexed by the points of M, that is, to pe M corresponds its 
tangent space T,(M). It follows that there is a natural projection x: T(M) >M taking the 
vector X „e T(M) to p. The vector field X as a function X: M > T(M), must satisfy the condi- 
tion z- X = ių, the identity on M. Further details are given in Exercises 5-7. 


Second, a vector field X is required to satisfy some condition of regular- 
ity, that is, of continuity or differentiability. We impose this as follows: For 
peM ict U, @ be any coordinate neighborhood of p, and let E,,,..., Enp be 
the corresponding basis (coordinate frames) of T,(M). Then X,, the value of 
X at p, may be written uniquely as X , = vey a'E;,. If p is varied in U, the 
components «,..., «” are well-defined functions of p which must, then, be 
given by functions of the local coordinates (denoted by the same letters) 


ai = at(xt,..., x")  i=1,...,7, on ọ(U)c R". 


We say that X is of class C", r > 0, if these functions are of class C” on U for 
every local coordinate system U,œ. Since the expressions given in 
Corollary 1.8 (see also Exercise 4) are linear with C” coefficients, we see that 
this definition is independent of the coordinates used. (Note that we include 
the case r = 0 of continuous components.) Collecting these requirements 
leads to the precise definition: 


(2.1) Definition A vector field X of class C’ on M is a function assigning to 
each point p of M a vector X,€ T,(M) whose components in the frames of 
any local coordinates U, ọ are functions of class C” on the domain U of the 
coordinates. Unless otherwise noted we will use vector field to mean 
C*-vector field hereafter. 


We remark that this definition is somewhat awkward, especially as re- 
gards the regularity condition; our treatment places reliance on local coor- 
dinates. One way to avoid this is to define X to be C” if for every C” function 
f whose domain W, is an open subset of U, the function Xf, defined by 
(Xf )(p) = X,f, is of class C”. Another very elegant approach is to give T(M) 
the structure of a C” manifold and then X becomes a mapping, X: M > 
T(M), of one C” manifold to another. In this case we have already defined 
the meaning of C” in Definition II1.3.3. We shall develop these important 
ideas in the exercises. 


(2.2) Example If we consider M = R? — {0}, then the gravitational field 
of an object of unit mass at 0 is a C%-vector field whose components 
a, «7, «°? relative to the basis é/éx! = Ey, 6/éx? = E,, and @/éx? = E, are 
d=, §=1,2,3 with r= (x!) + (x2? + OPP). 

r 
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(2.3) Example Given any coordinate neighborhood U, g on a manifold 
M, then U, being an open set of a manifold, is itself a manifold of the same 
dimension, say n. The vector fields E; = ø% '(6/0x'), i = 1, ..., n, have com- 
ponents a/ = ôj. These are constants and hence C% functions on U, so that 
each E; is a C®-vector field on U. The set E,,..., E, is a basis of T,(M) at 
each pe U, the coordinate frames (Fig. IV.5). 


More generally, a set of k vector fields on a manifold M, dim M = n, 
which is linearly independent at each point is called a field of k-frames on M. 
If k = n, then the frames form a basis at each point. Of course, it would be 
convenient if on a manifold one could always find such a field of n-frames, 


Figure IV.5 
Coordinate frames on U c M. 


for then the components of any vector field would be globally defined, that 
is, functions whose domain is all of M. This would relieve us of the necessity 
of using local coordinate neighborhoods and the associated frames 
E,,..., E,. However, it is known that this is not possible in general, for 
example, on the sphere S? it is not possible to define even one continuous 
vector field X which is linearly independent (nonzero) at each point of S°. 
This a classical theorem of algebraic topology discovered by Brouwer; it will 
be proved in Section V1.8. We shall give some further related examples for 
which we need the following lemma: 


(2.4) Lemma Let N be a submanifold of M and let X be a C®-vector field on 
M such that for each pe N, X „€ TN). Then X restricted to N is a C®-vector 
field on N (called the restriction of X to N). 


PDF pdfFactory Pro www.fineprint.cn 


2 VECTOR FIELDS 119 


Proof By hypothesis X assigns to each pe N the tangent vector X, in 
the subspace T,(N) of T,(M). We must prove that X restricted to N is of class 
C”. By choosing suitable cubical coordinate neighborhoods V, y and U, of 
peN in N and in M, respectively (cf. III.4.13 and II1.5.6), we may assume that 
the immersion (inclusion) is given locally by i:(y*,..., y > (1, -3 y" 
0,...,0). Thus V is a slice through p of U and Yy =zn°ọ|V, z denoting 
projection to the first n coordinates. We have y~' = g~* ci and hence for 
qe V, vw, (6/dy) = px '(@/6x,), so the coordinate frames at qe V are natur- 
ally identified with first n vectors E,,,..., Eng ofthe coordinate frames on U, 
E,, = 0x '(6/6x;). Now if on U, the C” vector field X, = }'s-1 #(Q)Ejq, then 
for qe V, the last m — n components vanish, since for qe V, X,€ T(N) by 
hypothesis. Thus, restricted to V, X, = yr, 4(q)Eig, where (4), .--, %(4) 
are the restrictions to the submanifold V of U and therefore C” on V. This 
shows that the restriction to N of X is C” in a neighborhood V of any pe N 
and is thus C™ on N. 


(2.5) Example Although no nonvanishing continuous vector field exists 
on the 2-sphere S?, there are three mutually perpendicular unit vector fields 
on S? c R$, that is, a frame field. Let S? = {(x!, x?, x3, x*)| YH, (x'? = 1) 
and let the vector fields be given by 


fa] ô 0 ô 
Z _ a20 1 4 3 
<= Ma age Bags Oe? 

ô ô ô ô 
= 3 _ a? 10 0 
Laena at ae ax 

ô fa) 0 é 
_ _,4 3 2 1 
a ga axe * axt X Jx 


at the point x = (x!, x?, x3, x*) of S°. Since at each point these are mutually 
orthogonal unit vectors in R*, they are independent. To see that they are 
tangent to S? it is enough to see that they are orthogonal to the radius vector 
from the origin 0 to the point x of S*; this is easy to check. There remains 
only the question of whether they are C®-vector fields. However, this is an 
immediate consequence of the preceding lemma with N = S° and M = Rt. 

It is possible to show that all odd-dimensional spheres have at least one 
nonvanishing C®-vector field and that—like S?—no even-dimensional 
sphere has any continuous nonvanishing field of tangent vectors. It has 
recently been proved that only the spheres S', S°, S7 have a C” field of bases 
as we have just seen to be the case for S*. Manifolds with this very special 
property are called parallelizable. As already mentioned, coordinate neigh- 
borhoods are parallelizable. 


Having established the concept of vector field on a manifold, we must 
now consider what happens when we map a manifold N on which a vector 
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field is defined into another manifold M. In Section 1 we saw that if 
F:N >M is a C” map, then to each point pe M there is associated a 
homomorphism F,: T,(N) > Trg(M). If X is a vector field on N, then 
F ,(X,,) is a vector at F(p). However, this process does not in general induce a 
vector field on M for various reasons: first, F(N) may not be all of M, that is, 
given qe M it may well happen that for no pe N is F(p) = q. Second, even if 
F~'(q) is not empty, it may contain more than one element, say pı, pa with 
pı # Pz, and then it may happen that F,(X,,) # F (X p) so that there is no 
uniquely determined vector Y, at q which is the image of vectors of the field 
X on N. It is easy to construct examples of these mishaps, for instance, let N 
be the half-space x! > 0 in R? and F: N > M be projection to the coordin- 
ate plane x? = 0. If X is the gravitational field of Example 2.2 restricted to 
N, we see that the image vectors do not determine a vector field on M. 


(2.6) Definition If, using the notation above, we have a vector field Y on 
M such that for each ge M and pe F~'(q) = N we have F,(X,) = Y,, then 
we say that the vector fields X and Y are F-related and we write, briefly, 
Y = F,(X). [We do not require F to be onto: If F~'(q) is empty, then the 
condition is vacuously satisfied.] 


(2.7) Theorem If F: N >M is a diffeomorphism, then each vector field X 
on N is F-related to a uniquely determined vector field Y on M. 


Proof Since F is a diffeomorphism, it has an inverse G: M > N, and at 
each point p we have F,: T,(N) > Tr (M) is an isomorphism onto with G, 
as inverse. Thus given a C®-vector field X on N, then at each point q of M, 
the vector Y, = F,(X Gq) is uniquely determined. It then remains to check 
that Y is a C®-vector field. This is immediate if we introduce local coordin- 
ates and apply Theorem 1.6 to the component functions. | 


We remark that under the hypotheses of Lemma 2.4 we have a second 
example of F-related vector fields: Let F: N > M be the inclusion map and 
let X’ be X restricted to N. Then X’ and X are F-related by the lemma. 
Further examples of F-related vector fields arise from the study of Lie 
groups. 


(2.8) Definition If F: M — M is a diffeomorphism and X is a C™ vector 
field on M such that F,(X) = X, that is, X is F-related to itself, then X is 
said to be invariant with respect to F, or F-invariant. 


(2.9) Theorem Let G be a Lie group and T(G) the tangent space at the 


identity. Then each X€ T,(G) determines uniquely a C®-vector field X on G` 
which is invariant under left translations. In particular, G is parallelizable. 
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Proof To each ge G there corresponds exactly one left translation L, 
taking e to g. Therefore if it exists, X is uniquely determined by the formula: 
X, = L,,(X,). Except for differentiability, this formula does define a left 
invariant vector field since for ae G, we have L,,(X4) = Lay © Lgy(Xe) = 
Lagx(Xe) = Xag- We must show that X, so determined, is C”. Let U, ọ bea 
coordinate neighborhood of e such that g(e) = (0,...,0) and let V be a 
neighborhood of e satisfying VV c U. Let g,he V with coordinates 
x = (x',..., x”) and y = (y',..., y"), respectively, and let z = (z',..., z”) be 
the coordinates of the product gh. Then z' = f'(x, y), i= 1, ..., n, are C” 
functions on g(V) x (V). If we write X, = )"_, yE,., y',..., y” real num- 
bers, then according to Theorem 1.6 the formula above for X, becomes 


.{ of! 
since in local coordinates L, is given by z‘ = f'(x, y), i= 1,..., n, with the 
coordinates x of g fixed. It follows that on V the components of X, in the 
coordinate frames are C® functions of the local coordinates. However, for 
any aéG the open set aV is the diffeomorphic image by L, of V. Moreover 
X, as noted above, is L,-invariant so that for every g = aheaV we have 
X, = L,,(X,,). It follows that X on aV is L,-related to X on V and therefore 
X is C* on aV by Theorem 2.7. Since X is C” in a neighborhood of each 
element of G, it is C” on G. | 


(2.10) Corollary Let G, and G, be Lie groups and F: G, + G, a homomor- 
phism. Then to each left-invariant vector field X on G, there is a uniquely 
determined left-invariant vector field Y on G, which is F-related to X. 


Proof By Theorem 2.9, X is determined by X,,, its value at the identity 
e, of G,. Let e, = F(e,) be the identity of G, and let Y be the uniquely 
determined left-invariant vector field on G, such that Y,, = F,,(X,.,). That Y 
should have this value at e, is surely a necessary condition for Y to be 
F-related to X ; and it remains only to see whether this vector field Y satisfies 
F,(X,) = Yro for every ge G,. If so, Y is indeed F-related (and uniquely 
determined). We write the mapping F as a composition F = Ly) ° F ° L,-1, 
using F(x) = F(g)F(g~‘x), and note that since both X and Y are left- 
invariant by assumption, this gives 


F (X,) = Lew © Fy ° Lg-14(Xq), 
F,(X,) = Lro ° F(X e) = Lee Yei > 
F(X) = Yro- 
Therefore Y meets all conditions and the corollary is true. i 
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Exercises 


1. Show that a function X assigning to each peM an element of 
T,(M)—as in Definition 2.1—is C® if and only if whenever f is a C?” 
function on an open set W, of M, then Xf, defined by (Xf )(p) = X „f; is 
C” on W,. 

2. Show that a C®-vector field X on M defines a derivation on C*(M) by 
f— Xf as defined in Exercise 1. 

3. Show that the derivations of C°(M), M a C® manifold, are ina natural 
one-to-one correspondence with ¥(M) the collection of all C”-vector 
fields on M. 

4. Show that the collection ¥ (M) of all C*-vector fields on M, is closed 
under addition and multiplication by C® functions [both defined poin- 
twise: (X + Y), = X, + Y, and (JX), = f(p)X;,]. 

5. Define a C* structure of a manifold on T(M) in sucha manner that for 
each coordinate no U, ọ on M, with local coordinates (x' < X") 
and frames E,,...,E,, the set U = 2 '(U) with mapping G Ŭ > 
R” = R" x R" a as follows is a coordinate oe. ‘oh 
pe U, X,€U, we suppose X, = = 2i- , wE;, and define Ẹ(X ,) = (x'(p 

e X"(p); a, , a") = (o(p) ney 

6. Using Bxercise 5, show that n: Ton — M is C”, a submersion, and that 
T,(M) = n~ *(p) is a submanifold of T(M). 

7. Using Exercise 5, show that the C%-vector fields on M correspond 
precisely to the C” mappings X: M > T(M) satisfying z ° X = iy , the 
identity map on M [x(X,) = p)]. 

8. Suppose F: N > M is C®. Then, in the notation of Exercise 7, show that 
F,: T(N) > T(M) is C” and me Fẹ = Fen. 

9. Show that if F: N > M is C® and X is a C*-vector field on N, then an 
F-related vector field Y on M, if it exists, is uniquely determined if and 
only if F(N) is dense in M. Note that this applies in particular to vector 
fields on submanifolds, i.e., for F: N > M a one-to-one immersion. 

10. Show that the restriction of 


on R?” to S2"-! defines a nonvanishing C®-vector field on S?”~. 

ll. Let F: M >M be a C® covering and Y any C%-vector field on M. 
Show that there is a unique C”-vector field X on M such that X and Y 
are F-related. 

12. Show that any C®-vector field Y on S"~' c R” is the restriction of a 
C”-vector field X on R”. 
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13. Given a C® mapping F: N > M and C®-vector fields X on N and Y 
on M, show that Y is F-related to X if and only if for any C® function g 
on M we have (Yg)° F = X(go F) on the inverse image F~ *(W,) of the 
domain W, of g. 


3  One-Parameter and Local One-Parameter Groups Acting 
on a Manifold 


We shall now subject the case of a connected Lie group of dimension 1 
acting on a manifold M to the same scrutiny as we did the case of a Lie 
group of dimension 0 in Section III.8, but with very different emphasis. At 
that time we were interested in the space of orbits; in the present instance we 
are mainly concerned with the relation to vector fields on M. For this reason 
we shall limit ourselves to the action of R, by which we denote the additive 
(Lie) group of real numbers R, acting on M since this will illustrate all the 
relevant facts—it can be shown that R and S' are the only connected Lie 
groups of dimension 1. These two cases, discrete Lie groups and the one- 
dimensional Lie group R acting on M, will give some idea of the depth and 
diversity of the whole subject of group action on manifolds. Later we shall 
have something to say about another special case: transitive action of a Lie 
group G on a manifold. 

Thus, in the present section, we consider Definition III.7.1 specialized to 
an action 8 of R on M. Let 0: R x M > M bea C” mapping which satisfies 
the two conditions: 

(i) @o(p) = p for all pe M, 

(ii) 0,° 0,(p) = 9,+(p) = 8, ° 9,(p) for all pe M and s, te R. 

[We will often write 6(t, p) as 0,(p) or 8,(t), depending on which variable is to 
be emphasized. ] 


(3.1) Example Suppose that M = R° and a = (a', a?, a°) is fixed and 
assumed different from 0. Then 0,(x) = (x! + a't, x? +.a7t, x? + @°t) 
defines a C” action of R on M. To each te R we have thus assigned the 
translation 0,: R? > R?, taking the point x to the point x + ta. This is a free 
action and the orbits consist of straight lines parallel to the vector a. A 
particularly simple special case is given by a=(1,0,0) so that 
0,(x) = (x! + t, x?, x’). 


Suppose that 6: R x M — M is any such C® action. Then it defines on 
M a C®-vector field X, which we shall call the infinitesimal generator of 0, 
according to the following prescription: For each peM we define 
Xp: C°(p) > R by 


(2) x, f= lim <-L/@ai()) -SO 
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We may check directly from (3.2) that X, is a vector at p in the sense of 
Definition 1.1, and then verify that p > X, defines a vector field, or we may 
proceed as follows. Let U, g be a coordinate neighborhood of pe M and let 
I; x V be an open subset of (0,p) in Rx M, where I; = fteR| 
— ô < t < ô}, and V and 6 > Oare so chosen that 0([; x V) c U. In particu- 
lar, V = 8,(V) is contained in U and contains p. Restricted to the open set 
I; x V, we may write @ in local coordinates 


y" = h”(t, xt, ..., xX”), 


or y= h(t, x), where x = (x!,..., x") are the coordinates of qe V and 
y = (yt, ..., y") of 8,(q), its image. The h’ are defined and C” on I, x (V) 
and the range of h(t, x) is in g(U). The fact that 05 is the identity and 
O+: = 9, © Qp, is reflected in the conditions: 


nO,x)=x' and A(t, +], x) = hi'(ty, h(tz , x)) 


fori = 1,...,n. Now iff (xt, ..., x")is the local expression for fe C” (p), then 


aC) -SO = 4, A) — Fl] 
and 
Kp f= lim Aa- SOE) 


where we have used a dot to indicate differentiation with respect to t. This 
formula is valid for every pe V and implies that on V, X, = > Ah'(O, x)E,, 
with E; = , '(6/0x') and x = (p), which shows that X is a C®-vector field 
over V. Since every point of M lies in such a neighborhood, X is C” on M. 
Note that definition of X at pe M involves only the values of 9 on J; x V, 
that is, like derivatives in general, it is defined locally and involves only 
values of t near t = 0. 


(3.3) Definition If @:G x M — M is the action of a group G on a mani- 
fold M, then a vector field X on M is said to be invariant under the action of G 
or G-invariant if X is invariant under each of the diffeomorphisms @, of M to 
itself, in brief if 6,,(X) = X (as in Definition 2.8). 


(3.4) Theorem If 0:R xM ->M is a C” action of R on M, then the 


infinitesimal generator X is invariant under this action, that is, 0,4(X >) = X exp) 
for allteR. 
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Proof Let fe C*(0,(p)) for some (t, p)e R x M and compute 0,,(X,)f: 


Bal K P= X f= 0) = lim y, Lf 0:0a) = f= Ao] 


However, R is Abelian and we have 0, o 04, = 8:44, = Oar £ 0, SO 


za d 
Diy (X p) = nmi a? 64)(8,(P)) — f(6,(p))] = Xop S. 
At> 
Since this holds for all f, the result follows. i 


(3.5) Corollary If X,, = 0, then for each q in the orbit of p we have X, = 0, 
that is, at the points of an orbit the associated vector field vanishes identically 
or is never zero. 


Proof The orbit of p consists of all q such that q = 0,(p) for some te R; 
thus by the theorem X, = 0,, Xp. Since 0, is a diffeomorphism, we know 
that 9,, is an isomorphism of T,(M) onto 7,(M)so that X, = 0 if and only if 
X, =0. | 


(3.6) Theorem The orbit of p is either a single point or an immersion of R in 
M by the map t > 0,(p), depending on whether or not X, = 0. 


Proof The orbit of p is the image of R under the C” map t — 6,(p) into 
M. Denote this map by F so that F(t) = 0,(p). Let tp € R and d/dt denote the 
standard basis of T, (R); F is an immersion if and only if F „(d/dt) + 0 for 
every toe R. Let fe C*(F(to)) = C%(6,,(p)) and observe that 


d d ee: 
F(a) fo a F),, = lim alt F(to + At) — f ° F(to)] 


II 


lim 5 LA (Or alp) — £(9,.(0))] 


= Xon S 
by precisely the same arguments as we used to prove Theorem 3.4. This 
formula and Corollary 3.5 show that either X, # 0 and F is an immersion 
or else Xpa = F,(d/dt) = 0, in which case F is a constant map with 
F(R) = p. For the proof of this last statement see Exercise 1. | 


We remark that the formula just obtained, namely, 


d 
F(a) = Xos) = X Fto > 


shows that at each point pe M the vector X, is tangent to its orbit and in 
fact is the (tangent) velocity vector of the curve t > F(t) in M in the sense in 
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which we have previously (Example 1.9) defined the velocity vector to a 
parametrized curve, that is, to a differentiable map of an open interval J of R 
into M, namely F,(d/dt). This latter notation is not too precise since it does 
not indicate that d/dt € T,,(R) and that F, is a homomorphism of this one- 
dimensional vector space into Tpy,,)(M). For this reason we often will write 
either F (to) or (dF /dt),, to denote the velocity vector. Sometimes it is conven- 
ient to let t > p(t) denote the mapping rather than F. Then its velocity vector 
is written dp/dt or p(t). For example, in the notation of Theorem 3.6, the 
formula above can be written O(t, p) = Xau, p- 

If we change parameter by a function t = f(s) so that s > G(s) = F(f(s)) 
gives the curve, then for to = f (So), 


dG d d dt d 
(i) = Se) = ela) = (isa) 


which give the formula 
d 
dey (a) p(4) 
ds } s ds) so “\dt} 1 


Thus the velocity vector with respect to s is a scalar multiple by (dt/ds),, of 
the velocity vector with respect to t. This may be conveniently written 


; . d 
B) G= (5) È) or H "a 


This vector equation is, of course, just a special case of the chain rule. 


(3.8) Definition Given a vector field X on a manifold M, we shall say that 
a curve t > F(t) defined on an open interval J of R is an integral curve of X if 
dF /dt = Xpo on J. By definition, an integral curve is connected. 


We have just shown that each orbit of the action @ is an integral curve of 
X, the infinitesimal generator of 0, that is, for each fixed pe M, 
A(t, p) = Xou, p 

At this point some natural questions arise concerning vector fields and 
one-parameter group actions: Is every C®-vector field the infinitesimal gen- 
erator of some group action? Can two different actions of R on M give rise 
to the same vector field X as infinitesimal generator? These questions will be 
answered in this and the next section. However, a simple but instructive 
example will illustrate the difficulties we face and show the necessity fora 
less restrictive concept of one-parameter group action. 


(3.9) Example Let M = R? and let 6:R x MM be defined by 
O(t, (x, y)) = (x + t, y). Then the infinitesimal generator is X = 0/0x. This 
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action is given by translation of each point (x, y) to a point t units to the 
right. Suppose now that we remove the origin (0,0) from R?; let 
My = R? — {(0, 0)}. For most points 0, is defined as before; however, we 
cannot obtain an action of R on Mg by restriction of 0 to R x Mo since 
points of the closed set F = {(t, (x,0))|t + x = 0} = 07 +(0, 0) of R x M 
are mapped by 0 to the origin. Let W c R x Mg be the open set 


W = ( U Rx {x v) ) v( Y tre RIG +t) > 0} x {(x, 0) 


y#0 


Then 0 = 6|W maps W onto M, and preserves many of the features of 0 
which we have used. For example, let p = (x, y)€ Mo, then 


(i) (0, p)e W and 8o(p) = p, 

(ii) 9, ° 8,(p) = 0,.(p) = 0, ° 8,(p) 
if all terms are defined. The infinitesimal generator X is defined by (3.2) 
just as before and is again X = @/@x. Finally we have orbits t > 6,(p), which 
are the lines y = constant as before when p = (x, y), y #0, and for 
p = (x, 0) the portion of the x-axis minus the origin which contains p. This 
curve is not defined for all values of ¢ in the case of the orbit ofa point on the 
x-axis. Exercise 10 gives an important example of a different type. A careful 
study of these examples will motivate the following complicated definition. 


First let M be a C” manifold and W c R x M be an open set which 
satisfies the following condition: 


(3.10) For every pe M there exist real numbers a(p) < 0 < f(p) such that 
Wo (R x {p}) = {(t, p) | (Pp) < £ < Alp)}. 


We shall denote by /(p) the interval «(p) < t < f(p) and by J, the inter- 
val defined by |t| <6. Condition (3.10) simply states that 
W = \)oem I(p) x {p}. Note that W is connected. Using this notation and 
with W as above we make the following definition: 


(3.11) Definition A local one-parameter group action or flow on a manifold 
M is a C* map 0: W > M which satisfies the following two conditions: 
(i) o(p) = p for all pe M. 

(ii) If (s,p)eW, then a(0,(p)) = a(p) — s, B(0,(p)) = Alp) — s, and 
moreover for any t such that a(p) — s < t < B(p) — s, 9,,,(p) is defined and 
0, < Op) = 0,+ (p). 

It is easy to check that the Example 3.9 given above has these properties. 
This example also shows that if we are to have any prospect of obtaining a 


correspondence between one-parameter group actions and vector fields, we 
must abandon the requirement that W is ali of R x M, which we shall calla 
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global action. Since W is open and contains (0, p) for each pe M, it also 
contains I, x U, U a neighborhood of p, for sufficiently small 6 > 0. There- 
fore the definition of the vector field X (infinitesimal generator) associated 
with @ as given by (3.2) is valid in the case of local action also and associates 
a C*-vector field to each flow 6. 

When R acts on M, as in the case of any group acting on M, for each t, 
0,: M > M isa diffeomorphism with 6; ' = 0_,. Something like this is also 
true for the local case of Definition 3.11, except that 6, is not defined on all of 
M in general. Let V,c M be the domain of definition of 6,, that is, 
V, = {pe M|(t,p)e W}; then we have the following consequence of 
Definition 3.11: 
(3.12) Theorem J, is an open set for every te Rand 0,: V, > V_, is a diffeo- 
morphism with 0, ' = 0 

Proof Let poe V,,, so that (to. po)e W. Since W is open, there is a 
ô > 0 and a neighborhood V of po such that {t| |t — ty| <6} x Vc W. 
In particular, {tg} x VcW so that V c V., so V, is open. Next, by 
Definition 3.11 (ii), if pe V,, then a(p) < t < f(p) and by (3.10) t + (—2) lies 
in the same interval. It follows that 0,(p)e V_, and 6_, ¢ 0,(p) = p. Similarly, 
6_(V_,) € V, and 6,° 0_,(q) = q for any ge V_,. Combining these state- 
ments with the fact that 6,,@_, are C% on any open subsets of M on which 
they are defined completes the proof. | 


-1t- 


(3.13) Remark For local one-parameter action we may show as in the 
global case that: 6,,(X p) = Xoyp if pe V,. As before, F(t) = 6,(p) defined for 
a(p) < t < f(p) is a C*-integral curve of X, which is an immersion of I(p) in 
M provided that X, # 0 and is a single point if X, = 0. We shall continue 
to refer to these curves as orbits of the local one-parameter group, just as in 
the global case. It is a consequence of our definitions that these curves (and 
points) partition M into a union of mutually disjoint sets. The proofs are the 
same, essentially, as in the global case. 


Finally, we wish to prove that in a neighborhood of any p for which 
X, #0, Example 3.9 is a prototype for every local (or global) one- 
parameter group action on a manifold. 


(3.14) Theorem Let 0: W —> M be as in Definition 3.11 and let X be the 
associated infinitesimal generator. If pe M such that X, + 0, then there is.a 
coordinate neighborhood V,w around p, a v > 0, and a corresponding neigh- 
borhood V’ of p, V' < V, such that in local coordinates 0 restricted tol, x V' 
is given by 

(yo WV) Y! + ty... y") 


In these coordinates X = W,,'(0/6y') at every point of V’. 
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Proof We shall use the notation and formula for X, developed in the 
discussion of (3.2). In W introduce coordinates U, p around p and express 0 
in the local coordinates by x > A(t; x), where x = (x',..., x") and A(t; x) 
stands for an n-tuple of functions satisfying: (i) h(0;x) =x, and 
(ii) h(t; h(t’; x)) = A(t + t’; x). We will assume coordinates so chosen that 
o(p) = (0,...,0), that g(U)= C0), and that X, = p,'(0/éx') = Eip. 
Then the expression for X,, X, =>, h'(0;0,...,0)E,,, implies that 
h'(0;0,...,0) is 1 for i = 1 and is 0 for i > 1. 

Choose 6>0 small enough so that V” = p~'(C3(0))c U and 
e(d, x V”) © U. Then map the cube C3(0) c I; x R"~' into e(U) by a map F 
defined by 

PO aie y") > (h (y1;0, oS nh 9, ere a} 
Since X, =F h(0;0)E;, = Ep (Gh'/dy')o = 54; and from (ii), y'= 
h'(0;0, y?,.-.,y"), (Ghi/Oy/), = 5; for j > 1. Thus the Jacobian of F at y = 
(0, ..., 0) is the identity matrix; hence there is a y > 0 with u < 6 such that F 
is a diffeomorphism of C%(0) onto an open set of C30) = (U). Let 
V = ` ' o F(C}(0)) and y = F~! o ọ; this is a coordinate neighborhood of p 
with V c U. 

The relations satisfied by h'(t, x), i = 1, ..., n, give 

(i) yip) = F~ '(ọ(p)) = F710, ...,0) 
and for (y',..., y")e C,(0) and |t| < v with v = 4/2 they give 

(ii) hie + yt; 0, 7,0... y") = h(t, hly; 0, 7,7) t= Lt 
Formula (ii) may be interpreted as follows: In the coordinate system (V, ), 
if W(q) = (y!,-.., y"), then W(0,(q)) = (t + y’,.-..9”), provided only that 
|t| <v and gew ‘(C%(0)), so that all functions are defined. In other 
words, in the y-coordinates of V, y, the mapping 0, is expressed by functions 
hi(t, y) defined on I, x C2(0) by 


h(t, yt... y") = t+ y, 
hit,y,..,7)=y! for i>. 
From these and the formula 
m ô 
W(X) = È k'(0, y) 3y = 


we have X, = w, ‘(d/éy') on V’ = y~ +(C3(0)). | 


ĉ 


ôy! 


> 


Exercises 


1. Let F: M > N bea C” mapping whose rank is everywhere zero (that is, 
F, = 0 at each pe M). Show that F maps each component of M into a 
single point. 
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2. Let F: M > N be a C” mapping and X and Y vector fields on M and N, 
respectively, which are F-related. Show that any integral curve of X is 
mapped by F into an integral curve of Y. 

3. Give a proof without using local coordinates that the infinitesimal gen- 
erator X of 0 as defined in (3.2) is a C®-vector field on M. [Hint: Use 
Exercise 2.1]. 

4. Verify the statements of Remark 3.13. 

5. Let R act on M = R? according to the formulas 


x> xcost + ysint, yo —xsint + y cost, 


which give 0,(x, y). Show that this is a globally defined group action of R 
on M and find X, the infinitesimal generator. What are the orbits? 
6. Show that (x, y) > O(t; x, y), defined by 


x> xe”, yoye* 


defines a C® action of R on M = R?. Determine the infinitesimal 


generator and show that it is 0-invariant. 
7. Let M = GI(2, R) and define an action of R on M by the formula 


A(t, 4)=(j Ja A € GI(2, R), 


with the dot denoting matrix multiplication. Find the infinitesimal 
generator. 

8. Let X = }}-, f'(x)(@/éx') be a C*-vector field on R” which generates 
an action @ on R”. Suppose @ to be given on its domain W by O(t, x) = 
(h'(t, x), .-., h”(t,x)) and suppose a = (a',...,a")e R". Then using 
Remark 3.13 show that x! = hi(t, a), i = 1,...,n, are solutions of the 
differential equations 


satisfying x' = af, i = 1, ..., n, when t = 0. Verify this for Exercises 6-8. 

9. Suppose that a C” action of R on M, M = R? has infinitesimal genera- 

tor X = x d/éx + y ĝ/ðy on M. Determine 0. [Hint: Try to find func- 

tions h'(t,x, y) and h(t, x, y) by solving the system of ordinary 

differential equations dx/dt = x. dy/dt = y with initial conditions 
x(0) = a, y(0) = b as in Exercise 8.] 

10. Let X = x? 6/dx define a vector field on M = R. Using the method of 
Exercise 8. find @ and its domain W. 
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4 The Existence Theorem for Ordinary Differential Equations 


In this section we state a very basic theorem of analysis which we need in 
order to answer some of the questions raised in the previous section and 
which will be applied in essential ways throughout the remainder of the 
book. 


(4.1) Theorem (Existence theorem for ordinary differential equations) 
Let U c R" be an open set and I,, e > 0, denote the interval —e < t < e, 
te R. Suppose fi(t, x}, ..., x"), i = 1,...,n, to be functions of class C", r > 1, 
on l, x U. 

Then for each x € U there exists 6 > O and a neighborhood V of x, V < U, 
such that: 


(I) For each a = (a’,...,a")€ V there exists an n-tuple of C’*? functions 
x(t) = (x"(t), ..., x"(t)), defined on I, and mapping 1, into U, which satisfy the 
system of first-order differential equations 

dxi 
(*) S = f'(i x), i=1,...,n, 


and the initial conditions 
(xx) x(0)= a,  i=1,...,” 


For each a the functions x(t) = (x'(t), ..., x"(t)) are uniquely determined 
in the sense that any other functions X(t), ..., X"(t) satisfying (*) and (#*) must 
agree with x(t) on an open interval around t = 0. 


(II) These functions being uniquely determined by a = (a',..., a") for 
every ae V, we write them x'(t, a',..., a"), i= 1,..., n, in which case they are 
of class C" in all variables and thus determine a C" map of Is x V > U. 


A proof of (I), which uses the contracting mapping lemma is given in an 
Appendix to this section (see p.172). The proof of (II) is more difficult and 
may be found in the work of Hurewicz [1], Dieudonné [1], or Lang [1]. 

If the right-hand side of (+) is independent of t, then the system of 
differential equations is called autonomous. Throughout the remainder of 
this chapter we shall deal only with autonomous systems. In this case it is 
possible to restate the hypotheses and conclusions of the fundamental exist- 
ence theorem in coordinate-free form using the concepts of vector field and 
integral curve. This will allow us to derive various global theorems useful in 
both geometry and analysis from a purely local existence theorem about 
open subsets of R”. 

We first reinterpret the existence theorem in the autonomous case, in 
which the functions f? depend on x = (x’,..., x") alone. For simplicity we 
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shall also assume hereafter that all data are C®. Define on U c R" a 
C®-vector field X by 
ô al 
— fi ae n 
X =f'(x)za + + IX) aa 


By Definition 3.8 an integral curve of X is a C” mapping F of an open 
interval (a, £) of R into U such that F(t) = Xpq for alla < t < B. If we write 
F in terms of its coordinate functions 


F(t) = (x"(0), ..., x*(0)), 


then the vector equation F(t) = Xpy is satisfied if and only if 


dx? og 
i = fixt). A) i= Len 
which states precisely that the functions x(t) = (x! (t), ..., x"(t)) are a solu- 


tion of (+). Given xe U, (I) of Theorem 4.1 states that for each a in a 
neighborhood V of x there is a unique integral curve F(t) (see Fig. IV.6), 
satisfying F(0) = a. F(t) is defined at least for —ô < t < 6 where ô > Ois the 
same for every ae V. If we use a notation for these integral curves through 
points of V which indicates dependence on the initial point a, say 


F(t, a) = (x` (t, a), ..., x"(t, a)), 


and use an overdot for differentiation with respect to t, these equations 
become 


x(t,a)=f'(x(t,a)) and xO,a)=a, i=1,...,n. 


Figure IV.6 
Integral curves of a vector field. 
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Part (II) of Theorem 4.1 states that these functions x(t, a) are C”—in all 
variables—on I, x V, an open subset of R x U. 

As an aid to intuition we may interpret the mapping F: I; x V > U asa 
“flow,” that is, a motion within U of the points of V so that the point at 
position a at time t = 0 moves to F(t, a) at time t (see Fig. [V.7). The path of 
a moving point is the integral curve, and its velocity at any of its positions is 
given by the vector X assigned to that point of U. 

We now turn to a vector field X on an arbitrary manifold, considering 
first a purely local question. 


Figure IV.7 


(4.2) Theorem Let X be a C*-vector field on a manifold M. Then for each 
pe M there exists a neighborhood V and real number 6 > 0 such that there 
corresponds a C® mapping 


0:1, x V >M, 


satisfying 

(*) A" (t, q) = Xora. o 

and 

(+x) 0”(0,qa)=4 forall qevV. 


If F(t) is an integral curve of X with F(0) = qe V, then F(t) = 0" (t, q) for 
|t| <ô. In particular, this mapping is unique in the sense that if V,, ô, is 
another such pair for pe M, then O% = 0"! on the common part of their 
domains. 
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Proof This is basically a restatement of the existence theorem as fol- 
lows. For pe M we choose a coordinate neighborhood U, p and map X to 
the g-related vector field X = g,(X) on U = ọ(U) c R". After applying the 
local existence theorem to obtain F:1, x VU defined by F(t, a) = 
(x'(t, a), ..., x(t, a)) on a neighborhood V c Uofg(p), weset V = — *( 
and define 0”: I, x V > U by 0”(t, q) = 9 (F(t, ọ(q)). Since and @ 
are diffeomorphisms, we see at once that 0” satisfies (*) and (**). The fina 
assertion is a consequence of the uniqueness of solutions. 


l= 
exw 


m — 


Finally we consider the global aspects of the theory, that is, given a 
vector field X on M, what can be said that goes beyond the description of the 
situation in a neighborhood of a point. Our main purpose is to establish the 
relation between vector fields on M and local one-parameter groups acting 
on M (Theorem 4.6). The first result depends only on (I) of the existence 
theorem. 


(4.3) Theorem Let X be a C™-vector field on a manifold M and suppose 
pe M. Then there is a uniquely determined open interval of R, I(p) = {a(p) < 
t < p(p)} containing t = O and having the properties: 


1) there exists a C*-integral curve F(t) defined on I(p) and such that 
2) given any other integral curve G(t) with G(0) = p, then the interval of 
definition of G is contained in I(p) and F(t) = G(t) on this interval. 


( 
F(0) 
( 


Proof Let F(t) and G(t) be two integral curves such that F(0) = p = 
G(0), and suppose Ir, Ig to be the open intervals on which they are defined, 
I* the set on which they agree. I* is not empty since it contains t = 0 and it 
is closed in Ip © Ig since F(t) and G(t) are continuous. Suppose s € I*, Since 
selpalg, an open set, there is some interval —5<t<6 on which 
F(t) = F(t + s) and G(t) = G(t + s) are both defined. They are both integral 
curves satisfying the same initial condition: F(O) = F(s) = G(s) = G(0). From 
the existence theorem they agree on some open interval |t| < 6 around t = 0. 
Thus F(t) = G(t) on an open set around s and /* is open. It follows that 
I* = 1, ^ Ig. Hence F and G define an integral curve on Ip U Ig. Thus, I(p) is 
the union of the domains of all integral curves which pass through pat t = 0; 
the asserted properties are immediate. Note that it is quite possible for 
a(p) = — œ and/or B(p) = + œ. If both occur, then F(p) = R. 


We shall use the notation F(t) = @(t, p) for the unique maximal integral 
curve F(t) such that F(0) = p. When we wish to emphasize dependence on t, 
or p, we may write 6,(t), or 6,(p), for A(t, p). 
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Now let the subset W c R x M be defined by 
W = {(t,p)eR x M |a(p) < t < B(p)}. 


According to what has been shown thus far both W and @ are uniquely 
determined by X and W is the domain of 8: W > M. Moreover we have the 
following properties of 0 and W: 


(i) {0} x M c W and (0, p) = p for all pe M. 
(ii) For each (fixed) pe M, let 6,(t) = 8(t, p). Then 


6,:1(p) +> M 
is a C”-integral curve, that is, 6,(t) = X9,¢), and is maximal. 


(iii) For each pe M there is a neighborhood V and a ô > 0 such that 
I, x Vc Wand @ is C” on I; x V. 


Using this notation and the same facts that we used in the proof of 
Theorem 4.3, we obtain the following addendum relating I(p) and I(q) for 
any two points p and q of the same integral curve. 


(4.4) Corollary Let seI(p) and q = 0,(s) = O(s, p) be the corresponding 
point of the integral curve determined by p. Then a(q) = alp) — s and B(q) = 
B(p) — s so that 


I(q) = 1(9,(s)) = {a(p) — s < t < B(p) — 5}. 
Thus te I(q) if and only if t + se I(p), and then we have 
Ht, O(s, p)) = A(t + s, p). 


Proof Suppose that seI(p) and let F(t) = 0,(s + t). Then F(t) is 
defined on the open interval (p) < s + t < f(p) and F(0) = 6,(s) = q and is 
an integral curve. By maximality and by uniqueness we have F(t) = 
O(t, 0,(s)) = A(t, q) so its domain must be F(q) = falqa) < t < B(q)}. | 


We take note that what has been proved to this point answers any 
questions we might have about the existence of integral curves of a vector 
field X on a manifold. It does not describe completely the nature of W. We 
do that now and at the same time specify the relation between vector fields 
and one-parameter group action. 


(4.5) Theorem For any C®-vector field X the domain W of 0(t, p) is open in 
R x M and @ is a C” map onto M. 


Proof We must show that for each (t, po) € W there is a neighborhood 
V of po and ô > O such that the open set (t — 6,t + 6) x V is in W and 0 is 
C® on it. This is already known to be the case for (0, po). If the theorem fails, 
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then there must be some (to, Pope W such that for each 0 < t < tọ there 
exists (t — ôt + ô) x V with the above properties, but not for (tg, Po). 
[We have assumed, without loss of generality, that tọ > 0).] We shall show 
by contradiction that there can be no (tg, Po). 

Should there be such, then using Theorem 4.2, we find 6, > 0 and a 
neighborhood V, of go = (to, po) such that I5, x Vo = W and @ is C” on it. 
By continuity of O(t, po) in t we may find t, < tọ with both | to — ti | < 469 
and O(t,, Po)€ Vo. Since t, < tg, by our assumption on (to, po) there is a 
ô, > Oand a neighborhood V, of pg such that (ti — ô, ti +ô) x Vic W 
and such that @ is C® on this open set. In particular, O(t,, po) is in Vo and 
6,,:V, >M is defined and C”, so we may suppose by continuity (and 
restricting V, if necessary) that 0,(V,) = Vo. We now have O(s + t., q) 
defined and C® on the open set |s| < ô, and q € V,; and its values for s = 0 
are in Vg. By Corollary 4.4 for a(6(t,, q)) < s < B(@(t,, q)) the equation 


O(s + tis q) = o(s, A(t), q)) 


is valid. Since 6(t,, q) is in Vy, by the definition of 6, and V, the interval 
1(0(t,, q)) contains all s for which |s| < ĉo. Thus 8(s + t,, q) is defined and 
C® for |s| < ĉo and any qe V,. However, this is an open set containing 
(to, Po) since |t) — tı | < 469. This shows that our assumption on (to, Po) 
leads to a contradiction. | 


We now note that (4.3)-(4.5) tell us exactly that 0: W — M defines a local 
one-parameter group action on M (Definition 3.11). It arises from X via the 
existence theorem and has X as infinitesimal generator. In fact it tells us 
somewhat more: Suppose 0, W — M and 6,: W, > M are local one-param- 
eter group actions. We will write 0, = 0, on W, ^ W,. Collecting our results, 
we have the following, which is equivalent to the existence theorem in the 
autonomous case: 


(4.6) Theorem In the notation above, 0, = 0, if and only if they have the 
same infinitesimal generator X. Further, every C® vector field X is the 
infinitesimal generator of a unique local one-parameter group action 6 on M 
whose domain W is maximal among all Ñ = 0. 


(4.7) Remark It is important to note that now Theorems (3.12) and (3.14) 
hold for the 6(t, p) defined on W by applying the existence theorem. Thus, 0, is 
a diffeomorphism of its domain V,= {pe M|tel1(p)}, an open set, onto 
V_,= {pe M|—te I(p)}. Further, if X, # 0, then there are local coordinates 
U, ọ in which X = E, = 9, '(6/0x') on U. 
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(4.8) Remark A general nth order ordinary differential equation in the 
independent variable t and dependent variable x and its derivatives is given 
by a relation 


We suppose that this is a function of class C” defined on some neighborhood 
in R”+? of the point (0, ap, a), 42, ...,4,) and that in a neighborhood U of 
this point we can write it in the form 


d"x | dx d"- z) 


(This can be done if the derivative of F with respect to its last variable is not 
zero at the point.) 
Now let x = x!, dx/dt = x°, ..., d” !x/dt"7* = x" and consider the first- 
order system of ordinary differential equations 
dx’ dx? dx" 
(x) —--= x72, =x), 1... = G(t,x!, x7,...,2774), 


dt 
with initial conditions 
(*x) xXO)=a',  i=1,...,n. 


The original nth order equation has a solution x(t) satisfying initial condi- 
tions (at t = 0): 


dx d""'x 
x(0) = a’, (3), Po oaa (To), a 


if and only if the first-order system (*) has a solution satisfying (**). Hence 
the existence theorem (Theorem 4.1) gives the existence and uniqueness of 
solutions of ordinary differential equations of nth order. This can be ex- 
tended also to systems of ordinary differential equations of higher order than 
one. The conclusions of Theorem 4.1 concerning uniqueness of solutions 
and differentiability of dependence on initial conditions are also valid in this 
more general situation. 

A second generalization is the case in which the functions f* of system (+) 
of Theorem 4.1 depend on parameters z',..., z” so that the system becomes 
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If we assume that the functions f‘ are of class C” in the z’s also on some open 
set U’ c R", that is, fi is a function of class C” on I, x U x U' c Rx 
R” x R", then the solutions will depend on the z’s as well as on the initial 
conditions: 


ee a (on eee a", Lee Z”). 


It is a further consequence of Theorem 4.1 that these functions are of class C” 
in all variables on an open set I, x V x V’ co R x R" x R”. This is very 
easily proved by introducing new equations of the form dzi/dt =0,j = 1,..., 
m, so that we are dealing with a system of n + m ordinary equations to 
which we apply Theorem 4.1. 


An application will be made of this idea in the following case. Choose a 
basis E;,..., E, of the tangent space at the identity e of a Lie group G and let 
X,(z',..., 2") denote the value at ge G of the uniquely determined left- 
invariant vector field X whose value X, at e has components zi, ..., z", that 
is, X, = )7-, 2'E;. With the choice of basis fixed, the left-invariant vector 
fields on G are thus parametrized by R”. The dependence on g and on the 
parameters is C” so that the solutions of the system of equations corre- 
sponding to each of the vector fields X (zi, ..., 2") is C” in all variables. Thus 
we have 0(t;g;z',..., 2"), which gives a C® mapping 0: R x G x R" > G 
and, for g, z fixed, determines the integral curve through g. 


Exercises 


1. Consider a system of n ordinary differential equations of second order in 
n unknown functions: 
d?xk 


k 1 
“dt? =f (ex pita Xe 


State as precisely as you can an existence theorem for solutions and 
derive it from Theorem 4..1. 

2. Give a detailed statement and proof of the two generalizations indicated 
in Remark 4.8. 

3. Let M = R?, the xy-plane, and X = y(0/dx) — x(d/dy). Find the domain 
W and the one-parameter group 0: W > M. 

4. Let X and Y be vector fields on manifolds M and N, respectively, and 
F: M > Na C” mapping. Show that X and Y are F-related if and only 
if the local one-parameter groups 0 and a generated by X and Y satisfy 
F o 0,(p) = c, ° F(p) for all (t, p) for which both sides are defined. 

5. Give a precise meaning to the following statement and then prove it: If 
the vector field X on M generates the local one-parameter group 0 
acting on M, X is invariant under the action. 
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6. Show that the orbits of a local one-parameter group may be defined in 
terms of an equivalence relation, just as in the case of a group G acting 
on a manifold. Show by example that the orbit space may fail to be 
Hausdorff. 

7. Show that the general (nonautonomous) system on R” of Theorem 4.1 
can be reduced to the autonomous case on R"*! by letting t = x"*! and 
adding an equation dx"*'/dt = 1 with x"*'(0) = 0. 

8. Derive Theorem 4.1 from Theorem 4.6. 


5 Some Examples of One-Parameter Groups Acting on a Manifold 


We shall now consider a local one-parameter group 0 with (maximal) 
domain W and infinitesimal generator X acting on a manifold M. For pe M, 
we continue to denote by I(p) the set «(p) < t < fi(p) of all real numbers t 
such that (t, p) is in W. The integral curve of X through p is given by 
6,: 1(p) > M, 0,(t) = O(t, p). If X, = 0, the curve is a single point p; other- 
wise 0, is an immersion as was shown earlier. In this latter case we consider 
now the nature of the integral curves on M. 


(5.1) Lemma Suppose that B(p) < œ and that {t,} < I(p) is an increasing 
sequence converging to B(p). Then {0(t,, p)} cannot lie in any compact set. In 
particular, the sequence {@(t,, p)} cannot approach a limit on M. A similar 
statement holds for a decreasing sequence approaching a(p) if a(p) is finite. 


Proof Let K be a compact subset of M and X a C*%-vector field on M. 
By the existence theorem to each q € M corresponds a ô > 0 and a neighbor- 
hood V of q such that @ is defined on I, x V. A finite number of such 
neighborhoods cover K and we let 69 be the minimum 6 for these neighbor- 
hoods. Then for each qe K, O(t,q) is defined for |t| < ĉo. Suppose 
{O(t,, P)} = K and that N is so large that B(p) — ty < 469. Then, by Corollary 
4.4, O(t, (ty, p)) is defined only for t< B(p) — ty < 6,/3. But this is a 
contradiction since (ty, p)€ K, so A(t, O(ty, p)) is defined at least for — ôo < 
t < ôo. This proves the first statement. The second is an immediate conse- 
quence for if lim,.,,6(t,, p) = q, then there is a neighborhood of q whose 
closure K is compact and contains all but a finite number of terms of the 
sequence {O(t,, p)}. We discard the terms not in K and obtain the same 
contradiction. Obviously the same arguments apply to decreasing sequences 
approaching a(p)—if «(p) is finite. 


(5.2) Corollary If I(p) is a bounded interval, then the integral curve is a 
closed subset of M. 


PDF pdfFactory Pro www.fineprint.cn 


140 IV VECTOR FIELDS ON A MANIFOLD 


(5.3) Corollary If X, = 0, then I(p) = R. If X = 0 outside a compact subset 
of M, thn W =R x M. 


We leave the proofs of these two corollaries to the exercises. 


(5.4) Remark A point p of M at which X, = 0 is called a singular point of 
the vector field and any other point is referred to as regular. We have seen 
(Theorem 3.14) that in the neighborhood of a regular point the integral 
curves are—to within diffeomorphism—the family of parallel lines 
x? = c?,..., X" = c" in R". On the other hand the pattern of integral curves 
at an isolated singularity can take many forms, even in the two-dimensional 
case, and has been extensively studied. At least in the two-dimensional case 
singularities can be visualized in terms of the integral curves of the vector 
field X near p. Some possibilities are shown in Fig. IV.8a-d. The cases (a) 
and (b) correspond to the field 
of ô of a 


ye we A 
grad f ôx 0x 7 dy dy 


(a) Extremum of f(x, y) {b} Saddle point of f(x, y) 


(NAD 
KAY 


(c) {d} 
Figure IV.8 
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at a point p = (Xo, yo) which is a simple extremum or saddle point of the 
function f(x, y). The level curves f(x, y) = constant are the dotted lines 
orthogonal to the integral curves. If p is a singularity of a general vector field, 
the pattern can be more complicated; possibilities are shown in (c) and (d). 
Interesting relations between the topological nature of the surface and the 
possible types of singularities possessed by a vector field on it were discovered 
by Poincaré, Hopf, and others (see Milnor [2] and Arnold [1]). These 
relations depend on the concept of the index i, of a vector field X at an 
isolated singularity p. In the two-dimensional case this is an integer defined as 
follows: Let C be a simple closed curve containing p in its interior—and no 
other singular point. As we traverse C counterclockwise, the vector field X,, 
qe cC, rotates, i.e., its angle 0 with a fixed direction changes by an integral 
multiple 2zi, of 27. (In Fig. IV.8a,b,c, the index of X is +1, —1, 2, 
respectively.) The results alluded to show that i, does not depend on the C 
chosen and amazingly that if X on a compact orientable surface M has only 
isolated singularities, then the sum of the indices is the Euler characteristic y. 
In particular, it is independent of X. A consequence of these relations is the 
fact already mentioned that a vector field on S?—in fact on any closed 
orientable surface except T?—must have at least one singular point. 


Another important question about a vector field X on M is whether or 
not it has closed integral curves—diffeomorphic to the circle S! (see 
Exercise 3). This can be of importance, for instance, in applications to dyna- 
mics. In these applications one considers the points of a manifold as corre- 
sponding to, or parametrizing, the states of a dynamical system. For 
example, if the system consists of the earth, sun, and moon, then in a fixed 
coordinate system the positions of the three objects can be characterized by 
nine numbers (three sets of coordinates) and their velocities, or momenta, by 
nine more (the components of three vectors). Thus each state or 
configuration corresponds to a point on a manifold M of dimension 18. The 
laws of motion can be expressed as a system of ordinary differential equa- 
tions or vector field X on M, and the integral curves correspond to the 
motions beginning from various initial states. A closed integral curve corre- 
sponds to a periodic motion, like that of the planets. This approach to 
mechanics was extensively studied by Poincaré and Birkhoff, and is still an 
active area of research (see Smale [2]). It has led to many interesting 
questions about vector fields and curves on manifolds. For example, it was 
very recently shown by Schweitzer [1], that there exist everywhere regular 
vector fields on S? without any closed integral curves—contrary to a long 
standing conjecture. Classical mechanics in the framework of manifold 
theory is very clearly set forth by Godbillon [1]. Other excellent books on 
differential equations and dynamical systems are Hirsch and Smale [1], 
Arnold [2], and Abraham and Marsden [1]. 
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(5.5) Definition A vector field X on M is said to be complete if it generates 
a (global) action of R on M, that is, if W = R x M. 


This is clearly the most desirable case and we find it very convenient to 
have sufficient conditions for completeness. One of them is an immediate 
corollary of Lemma 5.1. 


(5.6) Corollary If M is a compact manifold, then every vector field X onM 
is complete. 


To see that this is so we take K = M in the lemma and note that in this 
case a(p) = — œ and f(p) = +œ, that is, I(p) = R, for every pe M. 

This gives one important case in which we may be sure that a vector field 
is complete. A second case, which we will study in some detail, is a left- 
invariant vector field on a Lie group, as is shown by the corollary to the 
theorem which follows. 


(5.7) Theorem Let X be a C®-vector field on a manifold M and F: M > M 
a diffeomorphism. Let 0(t, p) denote the C” map 0: W > M defined by X. 
Then X is invariant under F if and only if F(0(t, p)) = 0(t, F(p)) whenever 
both sides are defined. 


Proof Suppose that X is invariant under F. If 0,: I(p) > M is the inte- 
gral curve of X with 0,(0) = p, then the diffeomorphism F takes it to an 
integral curve F(0,(t)) of the vector field F,(X). Since F(X) = X and 
F(6,(0)) = F(p), from uniqueness of integral curves we conclude that 
F(0,(t)) = 0(t, F(p)). This proves the “only if” part of the theorem. 

Now suppose that F(0(t, p)) = 0(t, F(p)) and prove that 
F,(X,) = Xr. This could be done directly from expression (3.2) for the 
infinitesimal generator X, but we shall proceed in a slightly different way. 
Let 0,(t) = O(t, p) and let d/dt be the natural basis of To(R), the tangent space 
to R at t = 0. Then, by definition, X, = 0,(0) = 0,,(d/dt) and applying the 
isomorphism F,: T,(M) > Trq(M) to this definition we have 

F(X ) = Fx ° 0 px(d/dt) = (Fo 0 ,)4(4/dt) = Or (pyx(4/dt) = Xr: 


P 


The second equality is the chain rule for the composition of mappings 
applied to 0,: R > M and F: M > M. The third equality uses the hypothesis 
that F - 0 (t) = Orp (t). i 


We remark that in the notation of Section 3 this theorem could be 
stated: F,(X) = X if and only if 0,9 F = Fo 0, on V,. 


(5.8) Corollary A left-invariant vector field on a Lie group G is complete. 
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Proof Let X be such a vector field. There is a neighborhood V of e and 
a ô > Osuch that 6(t, g) is defined on 7; x V. For he G, let L, denote the left 
translation by h. If we apply Theorem 5.7 with F = L,, then 0(t, L,g) = 
L, O(t, g), which shows that 0 is defined on J, x L,(V), a neighborhood of 
(0, h) in R x G. It follows that for every he G there is a neighborhood 
U = L,(V) such that 7; x U = W, the domain of 6 with the same 6 > 0 that 
we first obtained for V, that is, 6 is fixed and independent of h. By the same 
argument as in the compact case we obtain a contradiction if we assume for 
any ge M that either a(g) or B(g) is finite. Therefore W = R x M and X is 
complete. | 


We shall now carry our analysis of the Lie group situation somewhat 
further and in this way will obtain a number of examples of actions of R on 
manifolds. 


(5.9) Definition Let R be the additive group of real numbers, considered 
as a Lie group, and let G be an arbitrary Lie group. A one-parameter sub- 
group H of G is the homomorphic image H = F(R) of a homomorphism 
F: R > G. It is called trivial if H = {e}. 


We give here several simple examples of one-parameter subgroups. In 
the next section it will be shown how all such subgroups may be determined 
for linear Lie groups, that is, subgroups of Gl(n, R). Since we are interested in 
the action of R on manifolds, we recall at this point a comment and 
examples of Section III.7. Namely, let G be a Lie group which acts on a 
manifold M by 6:G x M >M and let F: R >G be a homomorphism. 
Then 0: R x M > M defined by 0(t, p) = (F(t), p) defines an action of R 
on M. Now applying Sections 3 and 4 we have an associated infinitesimal 
generator X, integral curves as orbits of the action, and so on. Since the same 
G may act on different manifolds, or in different ways on the same manifold, 
a fixed one-parameter subgroup of G will give many examples of a one- 
parameter group of transformations of a manifold. 


(5.10) Example Let G be the group Gi(3, R). We consider two one- 
parameter subgroups, that is, two homomorphisms F,, F, of R into G, 
defined as follows (a, b, ce R are constants): 


vw 0 0 1 at bt + 4act? 
F,()=] 0 œ O and F(t)=ļ0 1 ct 
0 O e@ 0 0 1 


It is left as an exercise to check that these are homomorphisms. Now 
GI(3, R) acts naturally on R°? (Example HI.7.4) and hence each F; defines an 
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action on R?. In the case of F, we have 0(t, x!, x?, x°) = (e"'x!, etx’, e"x>). 
Therefore the infinitesimal generator X is given at xe R° by 
. ĉ ) ĉ 
X, = o(0, x) = ax} axl + ax? 3 + ax? 3) 
and the integral curves, or orbits, are the lines through the origin (see 
Fig. IV.9). 

The group Gi(n, R) also acts on P*~*(R), since it preserves the equi- 
valence relation (proportionality) of n-tuples which defines it. Therefore 
GI(3, R) acts on two-dimensional projective space P?(R). In this case F, 
defines a trivial action 6(t, p) = p- 


(5.11) Example Let G be the Lie group SO(3) of orthogonal matrices with 
determinant +1. Define F: R > SO(3) and thus a one-parameter subgroup 


by 
cosat sinat O0 
F(t)= | —sinat cosat 0 
0 0 1 


Again, it is easily checked directly that this is in fact a homomorphism. Thus 
SO(3) acts on the unit sphere S? in a standard manner which we previously 
discussed (Section III.7 and Exercises). The action is just the usual rotation 
of the sphere, and F defines a one-parameter group of rotations holding the 
x? axis fixed: 


a(t, x!, x?, x?) = (x! cos at + x? sin at, —x! sin at + x? cos at, x°). 


xe 


Figure IV.9 
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Figure IV.10 


The orbits are the lines of latitude and the generator X is tangent to them 
and orthogonal to the x-axis. X =0 at the north and south poles 
(0, 0, +1). (See Fig. IV.10.) 


(5.12) Example We recall also that a Lie group G acts on itself (on the 
right) by right translations. Thus if we are given a homomorphism 
F: R > G, we may define an action 0 of R on M = G by O(t, g) = Rra(g) = 
gF(t). We have used R, to denote right translation: R,(g) = ga. As 
previously noted in Section III.7, this is a composition of C® maps, F, and 
right translation. It is an action since F is a homomorphism and multiplica- 
tion is associative: 


(i) 9(0, g) = gF(0) = g, 
(ii) Olt + s, g) = gF(t + s) = g(F(t)F(s)) 
= (gF(t))F(s) = (t, O(s, g)). 


Thus the examples above furnish further examples of one-parameter group 
action, namely, on M = Gi(3, R) and M = O(3), respectively. 


Recalling that a left-invariant vector field on G is uniquely determined by 
its value at the identity e, we may use these ideas to characterize one- 
parameter subgroups of a Lie group. 


(5.13) Theorem Let F:R >G be a one-parameter subgroup of the Lie 
group G and X the left-invariant vector field on G defined by X, = F(0). Then 
O(t, g) = Req(g) defines an action 0:R x G >G of R on G (as a manifold) 
having X as infinitesimal generator. Conversely, let X be a left-invariant vector 
field and 0: R x G—G the corresponding action. Then F(t) = O(t, e) is a 
one-parameter subgroup of G and O(t, g) = Rra lg). 
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Proof Given the C” homomorphism F: R > G, then 6: R x G > G, 
defined by 0(t, g) = Rrw(g) = gF(t) is, as we have just seen, an action of R 
on G. If aeG, then L,6(t, g) = a(gF(t)) = (ag)F(t) = 0(t, L.(g)). By 
Theorem 5.7 it follows that the generator X of 0 is L,-invariant, for any 
ac G. However, 0(t, e) = F(t), and so X, = 6(0, e) = F(0), which proves the 
first half of the theorem. 

For the converse X, being left-invariant, is both C® and complete and it 
generates an action 0 of R on G. By Theorem 5.7 for any left translation L, 
we have L,O(t,g) = O(t, L,(g)) or equivalently, hO(t, g) = 0(t, hg). Let 
F(t) = 0(t, e) and h = F(s). Then this relation implies 


F(s)F(t) = F(s)0(t, e) = 0(t, O(s, e)) = A(t + s, e) = F(s + t). 


Thus t > F(t) is a C® homomorphism. But F(0) = 6(0, e) = X, and since X 
is left-invariant, we see by uniqueness of the action generated by X that 
O(t, g) = Rro lg), the action defined just previously. | 


(5.14) Corollary There is a one-to-one correspondence between the ele- 
ments of T.(G) and one-parameter subgroups of G. For ZeT,(G) let 
t > F(t, Z) denote the (unique) corresponding one-parameter subgroup. Then 
F:R x T.(G) > G is C® and satisfies F(t, sZ) = F(st, Z). 


Proof According to Theorem 5.13, each Z € T.(G) determines a unique 
homomorphism t > F(t, Z) of R into G such that F(0, Z) = Z. By our exten- 
sion of the existence theorem at the end of Remark 4.7 we see that F is C” 
simultaneously in t and Z [identifying T,(G) with R” by some choice of 
basis]. Using the rule for change of parameter in a curve ona manifold, we 
have 


t=0 


f F(ts, al = sli F(t, 2) = sZ. 


On the other hand t > F(ts, Z) is a homomorphism. Therefore, by uni- 


queness, F(st, Z) = F(t, sZ). | 
Exercises 

1. Prove Corollary 5.2. 

2. Prove Corollary 5.3. 


3. Let X be a vector field on M and let F: I(p) > M be the integral curve 
determined by F(0) = p. Suppose for some real number c > 0, F(c) = 
F(0). Show that this implies I(p) = R and F(t) = F(t + c) for all te R. If 
X, #0, then prove that there is a diffeomorphism G: S! > M and a 
number cy, O<co<c, such that F=Gon, with m:R- 
S! = {zeC| |z| = 1} denoting the mapping x(t) = ge, 

4. Given pe M, show that if I(p) is bounded for a C®-vector field X on M, 
then t > b(t, p) is an imbedding of I(p) in M. 
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5. Given pe M and a C%-vector field X on M, let {t,} be a monotone 
increasing (decreasing) sequence of I(p) which has no limit on (p). Show 
thatiflim,.,, O(t,, p) exists, then a(p) = — œ (B(p) = + œ, respectively). 
Let L*(p) [or L’(p)] denote the collection of all such limit points 
for increasing (decreasing) sequences. Show that L*(0(t, p)) = L*(p) 
for every t € I(p) and that L*(p) is a closed set and a union of integral 
curves. 

6. Show that a one-parameter subgroup H of a Lie group G which is not 
trivial is either an isomorphic image of S' or R; and in the latter case is a 
submanifold whose closure is an (algebraic) Abelian subgroup, possibly 
larger than H. Give examples of each case. 


6 One-Parameter Subgroups of Lie Groups 


We have seen that one-parameter subgroups of a Lie group G are in 
one-to-one correspondence with the elements of T,(G). We shall use this to 
help determine all one-parameter subgroups of various matrix groups. We 
first consider G = Gl(n, R). The matrix entries x;;, 1 < i,j <n, for any 
X = (x;;)€ Gl(n, R) are coordinates on a single neighborhood covering the 
group, which is an open subset of .@,(R), the n x n matrices over R. There- 
fore @/éx,;, 1 <i, j <n, is a field of frames on G and, relative to these 
frames as a basis at e, there is an isomorphism of .@,(R) as a vector space 
onto T,(G) given by A = (a;;) > J; j 4;(0/0x;;)e. [When G = Gl(n, R), e is the 
n x n identity matrix 1.] 


(6.1) Definition The exponential e* of a matrix X € M,(R) is defined to be 
the matrix given by 
1 


1 
(*) CLK tek eR 


if the series converges. 


(6.2) Theorem Series (+) converges absolutely for all X € M,(R) and uni- 
formly on compact subsets. The mapping “,(R) > ,(R) defined by X > e* 
is C” and has nonsingular Jacobian at X = 0. Its image lies in Gl(n, R). If 
A, Be M"(R) such that AB = BA, then e4*® = e4e®. 


Proof If we denote by x the entries of the matrix X* with 
X' = X = (x;;) and X° = J = (6,,), and we let p = sup,<; jen | Xi], then 
by induction on k we have the inequality 


|x | < (np). 
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This is true for k = 0, and if it holds for k, then 


‘a 1 


xfer?) = |Z xP] < < n(np)*p = (np 


From this it follows that the sequence e* converges absolutely for every X 
and that it converges uniformly on every compact subset of .@,(R); indeed 
each compact set is contained in a set K, = {X||x;;| < p}. Using absolute 
ee each entry of e* may be written as a power series in any of the 

;j converging on all of R (or even C). Hence X > e* is analytic in each x,, and 
is si ae s an analytic mapping (by Hartog’s theorem). 

If we denote by f(X) the coordinate functions of the mapping, then the 
terms of degree less than 2 in the variables x,, are 


SAX) = ôy + xij, 1<ij<n; 


hence the Jacobian matrix at X = 0 reduces to the n? x n? identity matrix. 

Finally, using the fact that the convergence is absolute, so that we may 
rearrange terms, and an analog of Cauchy’s theorem for multiplication of 
series, when AB = BA we obtain the equality 


(Sa4)(S 52) - 2 3 : amr’ B= (A+ BY". 


kao kK! I! woo pzo (m — p)! p! m M 


From this we may deduce efe? = e4*® (see Exercise 1). In particular, this 
implies e4e~ 4 = e? = I so that e^ is nonsingular, that is, ef € Gi(n, R) for 
any a€ M ,(R). This completes the proof of the theorem. | 


(6.3) Corollary F(t) = e^ is the one-parameter subgroup of Gl(n, R) whose 
corresponding left-invariant vector field has the value Yi, j 4ij(0/0%:; pe All 
one-parameter subgroups of Gl(n, R) are of this form. Further, F (0) = 


Proof The corollary is an immediate consequence of the theorem. For 
every te R, t, A and t, A commute, thus e1*!)4 = ef 4e"4 and t > e'f isa 
group homomorphism: it is C” since it is a restriction of a C”-map on 
M, (R) to the submanifold {tA | t € R}. Finally, writing x;,(t) for the ijth entry 


of e'4 and letting A = (a;;), we have 
x(t) = ôy + ta; + O(t?) 


so that x,(0) =a, 1 <i, j <n or, equivalently (de'4/dt),-9 = A. This 
proves the corollary. 
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(6.4) Example 


b 
c | €E M3(R) 
0 


a 
Il 
ooo 
oo;. 


implies 
e4=1+4+tAt+ht?A? +. 
However, A* = 0 if k > 2 so that we obtain once again Example 5.10: 


1 ta tbh + 4t?ac 
e4=[0 i tc 
0 0 1 


By virtue of the following theorem, we can use the mapping X — e* to 
determine the one-parameter subgroups of other matrix groups, for exam- 
ple, O(n), Sl(n, R) (see Exercise 6), and so on. 


(6.5) Theorem If H is a Lie subgroup of G, then the one-parameter sub- 
groups of H are exactly those one-parameter subgroups t > F(t) of G such that 
F(0)e T.(H) considered as a subspace of T,(G). 


Proof Let F:R— H be any one-parameter subgroup of H. Since 
H c G and the inclusion is an immersion, in particular is C”, the map F 
followed by inclusion is a one-parameter subgroup of G. Its tangent vector at 
any point is tangent to H. In particular, F(0)e T,(H) a subspace of T,(G). 
Conversely, if F: R + G is a one-parameter subgroup such that F(0) € T.(H), 
then F(0) determines a one-parameter subgroup of H, F,: R > H, with 
F,(0) = F(O). As we have just seen, F, can be considered a one-parameter 
subgroup of G, but since F and F, have the same tangent vector at e, they 
must agree. Therefore the correspondence is one-to-one as claimed, which 
completes the proof. | 


Suppose that G = Gl(n, R) in the discussion above, then we have the 
following application. 


(6.6) Corollary The one-parameter subgroups of a subgroup H c Gl(n, R) 
are all of the form F(t) = e'4, where A = (a,;) are the components of the vector 
F(0) = Yi j Uf 0/OXije € TAG) which is tangent to H at e, i.e., is in TAH) c 
TG). 


This is an immediate consequence of the theorem and the fact that every 
one-parameter subgroup of G = Gl(n, R) is of the form F(t) = e4 
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(6.7) Example Let H = O(n), G = Gl(n, R), and determine the one- 
parameter subgroups of H. If e'4^ € H for all t, then (e'4)(e'4)’ = I, where the 
prime indicates the transpose. It is an immediate consequence of 
Definition 6.1 that (e'4)’ = e'*’; and, by Theorem 6.2, (e'4)"? = e ‘4. From 
these facts we conclude that e'4 € H implies e^ = e~ ‘4. Taking derivatives on 
both sides at t = 0, we see that A’ = — A. Conversely, if A’ = — A, then 
et4(el4y = e'4e'4’ = e'4e~'4 = J, which means that e4 is an orthogonal ma- 
trix. Let o(n) denote the linear subspace of skew symmetric matrices in 
M,(R); it has dimension n(n — 1)/2. Since X > e* has nonsingular Jacobian 
at 0 €.%,(R), it is a diffeomorphism near O. The same will then be true of its 
restriction to o(n): A neighborhood of O € o(n) maps diffeomorphically onto 
a neighborhood of J e O(n). This proves the following: 


The homomorphism t > e'4 is a one-parameter subgroup of O(n) if and only 
if A' = — A, which is the necessary and sufficient condition on A = (a;;) in 
order that the tangent vector Y; ; a;;(6/0x;;), to Gl(n, R) at the identity be 
tangent to the subgroup O(n). The dimension of O(n) is n(n — 1)/2. 


Finally we recall that if G is a Lie group and Z e T,(G), then Z determines 
uniquely a one-parameter subgroup which we denoted earlier by F(t, Z). We 
use this to define an exponential mapping on an arbitrary Lie group. 


(6.8) Definition The exponential mapping, exp: T,(G) > G, is defined by 
the formula exp Z = F(1, Z). 


According to Theorem 5.13 we have the following properties: 


(6.9) Theorem For any Lie group G the mapping exp: T,(G) > G is C* and 
F(t) = exp tZ is the unique one-parameter subgroup such that F(0) = Z. The 
Jacobian matrix at 0 of exp is the identity, that is, at e, exp, is the identity.t 
Finally, if G is a subgroup of Gl(n, R), then for each Z € T,(G) there is an 
A = (a;;)€-@,(R) such that Z = } a,,(0/0x;;),, and for this Z, exp tZ = e!4. 


Exercises 


1. Complete the proof that when A, B are commuting n x n matrices, then 
e^*B — e4e* (Hint: first prove this for the exponential function on R, 
using multiplication of power series; then try a similar proof.] 

2. Check directly that the mapping t > e'4 of Example 6.4 is a group 
homomorphism. 


+ This requires that we identify the tangent space to 7,(G) at Z = 0 with T,(G) itself, a 
common practice when working with vector spaces; exp, is the mapping of tangent spaces 
induced by exp. 
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3. Let A = (a,;) be ann x n matrix such that a;; = O ifj < i. Prove that the 
one-parameter subgroup e’ is not a circle group in Gi(n, R). 
4. Find the one-parameter subgroups of G/(2, R) corresponding to A and B 


with 
0 1 0 1l 
a(i o} P= o) 
Find the corresponding actions on R? and their infinitesimal generators, 
starting from the natural action of Gi(2, R) on R?. 

5. Show that for any Lie group G the mapping u:G —> G defined by 
u(g) =g~' takes left invariant vector fields to right invariant vector 
fields. 

6. Prove that if A is a nonsingular n x n matrix and X €.@,(R), then 
Ae*A~! = e4*4-'. From this deduce that det e* = e*. Use this to 
determine those matrices A such that e'4 is a one-parameter subgroup of 
Si(n, R). 

7. Using the coordinate frames 6/¢x,;, i < i,j < n on Gl{n, R), show that 
the vector field Z on Gi(n, R) whose matrix of components at the iden- 
tity is A =(a,,) and XA at the element X = (x;;) of Gl(n, R) is a left- 
invariant vector field. 


7 The Lie Algebra of Vector Fields on a Manifold 


We denote by ¥ (M) the set of all C%-vector fields defined on the C% 
manifold M. It is itself a vector space over R since if X and Y are C*-vector 
fields on M so is any linear Combination of them with constant coefficients. 
In fact any linear combination with coefficients which are C” functions on 
M is again a C®-vector field. For X, Y e ¥(M) and f, ge C*(M) implies that 
the vector field Z = fX + gY, with the obvious definition Z, = f(p)X, + 
g(p)Y, for each pe M is a C®-vector field. We may express this as follows: 


¥(M) is a vector space over R and a module over C”(M). 
As a vector space ¥(M) is not finite-dimensional over R (Exercise 1). In fact 


X(M) is something more than just a vector space as we shall see. 


(7.1) Definition We shall say that a vector space ¥ over R is a (real) Lie 
algebra if in addition to its vector space structure it possesses a product, that 
is, a map Z x L > fF, taking the pair (X, Y) to the element [X, Y] of Y, 
which has the following properties: 


(1) it is bilinear over R: 
[xX + 82X2, Y] = a,[X1, Y] + o[X2, Y], 
[X, a; Y, + a, ¥,] = «,[X, ¥,] + fX, Yo]; 
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(2) it is skew commutative: 
x, Y] = -[X, Y]; 
(3) it satisfies the Jacobi identity: 
[X, [Y, Z]] + [Y, [Z, X]] + [Z, [¥, Y]] = 0. 


(7.2) Example A vector space V°, of dimension 3 over R with the usual 
vector product of vector calculus is a Lie algebra. 


(7.3) Example Let .@,(R) denote the algebra of n x n matrices over R 
with XY denoting the usual matrix product of X and Y. Then [X, Y] = 
XY — YX, the “commutator” of X and Y, defines a Lie algebra structure 
on .4,(R) as is easily verified. 


Now suppose that X and Y denote C®-vector fields on a manifold M, 
that is, X, Y e ¥(M). Then, in general, the operator f > X,(Yf) defined on 
C*(p}-f being a C” function on a neighborhood of p—does not define a 
vector at p. Thus XY, considered as an operator on C” functions on M, does 
not in general determine a C®-vector field. However, oddly enough, 
XY — YX does; it defines a vector field ZeX(M) according to the 
prescription 


Z f= (XY — YX), f= X,(Yf) — ¥,(Xf), fe C*(p). 


For if fe C”(p), then Xf and Yf are C” on a neighborhood of p, and this 
prescription determines a linear map of C®(p) > R. Therefore if the Leibniz 
rule holds for Z,, then Z, is an element of T,(M) at each pe M. Consider 
f, g e C”(p). Then f, g e C*(U) for some open set U containing p. Using the 
notation (Xf), for X,f, the value of Xf at p, we have the relations: 


(XY — YX) (fa) = X,(¥fg) — Y,(Xfa) 

= X,(f¥g + gYf) — Y,(fXg + gXf) 

= (X,f)Y9)p + f(P)X(¥9) + (XpgVP)p 
+ g(p)X AYS) — (Y, S XX9) — F(p)¥,(X9) 
— (Y IXS) — gp ¥,Xf), 

so that 
Z,(f9) = (XY — YX),( fg) =f (P\(XY — YX),g + glp(XY — YX), f 
= f (p)Z,9 + g(p)Z, f. 


Finally, if f is C” on any open set U c M, then so is (XY — YX)f, and 
therefore Z is a C”-vector field on M as claimed. 
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We may define a product on ¥ (M) using this fact; namely, define the 
product of X and Y by [X, Y] = XY — YX. 


(7.4) Theorem X(M) with the product [X, Y] is a Lie algebra. 


Proof fa, Be Rand X,, X2, Y are C%-vector fields, then it is straight- 
forward to verify that 


[aX, + 8X2, Y]f = AX, Y]f+ PX, YI: 


Thus [X, Y] is linear in the first variable. Since the skew commutativity 
[X, Y] = —[Y, X] is immediate from the definition, we see that linearity in 
the first variable implies linearity in the second. Therefore [X, Y] is bilinear 
and skew-commutative. There remains the Jacobi identity which follows 
immediately if we evaluate [X, [Y, Z]] + [YX [Z, X]] + [Z, [X, Y]] applied 
to a C*-function f. Using the definition, we obtain 


[X, [Y, Z]] f= X((LY. ZDS) — LY, ZX) 
= X(¥(Zf)) — X(Z(¥f)) — ¥(Z(Xf)) + Z(Y (Xf) 
Permuting cyclically and adding establishes the identity. 


(7.5) Remark [X,Y] is not C%(M) bilinear. In fact for fe C*(M), 
[X, fY] =f[X, Y] + (Xf)Y as is shown in Exercise 2. This may be used to 
derive a formula for the components of [X,Y] in local coordinates 
(Exercise 3). 


We now make use of a vector field X on M to define a method of 
differentiation which has many applications in manifold theory. We have 
already defined the derivative of a function fe C*(M) at a point p in the 
direction of X; it is just X,f. This generalizes from R” to an arbitrary 
manifold the notion of directional derivative of a function. However, if we 
wish to determine the rate of change of a vector field Y at pe M in some 
direction X,,, we have trouble as soon as we leave R", for it is only in R” that 
we are able to compare the value of Y at p with its value at nearby points, 
which we must do to compute a rate of change. Now, given a vector field X 
on M, there is an associated one-parameter group 0: W —> M generated by 
X. For each te R we know (Theorem 3.12) that 6,: V, > V_, is a diffeomor- 
phism (with inverse @_,) of the open set V,, provided V, is not empty. In 
particular for each pe M there is a neighborhood V and a ô > 0 such that 
V c V, for |t| < ô. The isomorphism 6,,: T,(M) > To,p)(M) and its inverse 
allow us to compare the values of vector fields at these two points. 

Indeed, suppose Y is a second C”-vector field on M. We may use this 
idea to compute for each p the rate of change of Y in the direction of X, that 
is, along the integral curve of the vector field X passing through p. We shall 
denote this rate of change by Ly Y; it is itself a C*-vector field. 
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(7.6) Definition The vector field Ly Y, called the Lie derivative of Y with 
respect to X is defined at each pe M by either of the following limits. 


. 1 
(Ly Y), = lim pB- ix( You. p) = Y,] 


t>0 


= lim E — 914 Yous, p) 
170 

The second definition is obtained from the first by replacing t by —t. We 
interpret the first expression as follows: Apply to Yoo. p) € Tou, »(M) the iso- 
morphism 0_,,, taking Ty, p(M) to T,(M). Then in T,(M) take the differ- 
ence of this vector and Y,, multiply by the scalar 1/t, and pass to the limit as 
t—+0. Equivalently, we may identify Z,(t) = 0~ (You, »)€7,(M) with a 
curve of R"; then (Ly Y), = Zz (0). The fact that the vector field so defined is 
C”? may be verified by writing the formula above in local coordinates 
(Exercise 6). We shall give another characterization of Ly Y which requires a 
modification of Lemma II.4.3, following Kobayashi and Nomizu [1, p. 15]. 


(7.7) Lemma Let X be a C*-vector field on M and 6 be the corresponding 
map of W < R x M onto M. Given pe M and fe C*(U), U an open set 
containing p, we choose ô > 0 and a neighborhood V of p in U such that 
Od, x V) <U. Then there is a C* function g(t.q) defined on I; x V such 
that for qe V and te I; we have 


f(g) = f(g) + tg(t,q) and X,f = (0, q). 


Proof There is a neighborhood V of p and a ô > 0 such that 6,(p) = 
O(t, p) is defined and C* on I; x V and maps J; x V into U according to 
Theorem 4.2. The function r(t, q) = f(9,(q)) — f (q) is C” on I; x V and 
r(0, q) = 0. We denote by 7(t, q) its derivative with respect to t. We define 
g(t, q)—for each fixed q—by the formula 


g(t, q) = | F(ts, q) ds. 
0 


This function is also C~ on I, x V (verified by use of local coordinates and 
properties of the integral). By the fundamental theorem of calculus, 


tg(t.q) = i i(ts, q)t ds = r(t, q) — r(0, 4) = r(t, q). 


Using the definition of r, this becomes 


f4) = f(g) + tglt, q). 
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On the other hand, by the definition (3.2) of the infinitesimal generator of 0, 


g(0, q) = im g(t, q) = lim | 7 w(t, q) = lim | z EFCC) —figl=X,f. I 


We use the lemma to prove the following theorem: 


(7.8) Theorem If X and Y are C”-vector fields on M, then 
LY = [X, Y]. 


Proof For any pe M and any function f which is C” near p, we have for 
some 6 > 0 


1 1 
t LY, = B( Yor —1, pd IS = t [Y,f T Y-t, pf 9 0,)] 


for 0 < |t| < 6. If we can show that as t > 0 the limit of the right-hand side is 
[X, Y], f for all such f, then (L,Y), is defined: (L, Y) f = LX, Y],/f for all 
f; and thus (L, Y), = [X, Y],. So the conclusion of the theorem follows. 
Making use of Lemma 7.7 and setting g,(q) = g(t, q), we are led to consider 
then 


lim 7 CY, — Veen + tgd]. 


Collecting the first two terms and replacing t by —t yields 


lim; [(Yf)(9.(p)) — (Yf Xp) — eh Yount + 


By formula (3.2) with f replaced by Yf and At by t, the first term is X,(Yf). 
Since 0,(p) > p and gq) > X,f ast > 0, Yoip9 > YAX f) and we obtain in 
the limit 


This completes the proof of the theorem; it also shows that Ly Y is C”. Jj 
(7.9) Theorem Let F:N + M be a C” mapping and suppose that X,, X, 
and Y,, Y, are vector fields on N, M, respectively, which are F-related, that is, 


for i = 1,2, F(X = Y,. Then [X,, X2} and [Y,, Y,] are F-related, that is, 
F(X, Xa] = [Fy(X1), F (X2) 


Proof Before proving the theorem we note the following necessary and 
sufficient condition for X on N and Y on M to be F-related: for any g which 
is C” on some open set V c M, 


(*) (Yg) °- F = X(g° F) 
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on F~!(V). This is essentially a restatement of the definition of F-related, for 
if qe F~*(V), then F,(X,)g = X,(g° F) = X(g° F)(q); and Ypg is the 
value of the C® function Yg at F(q), that is, ((Yg) ° F)(q). Thus (+) holds for 
all g if and only if F,(X,) = Yr for all qe M. 

Returning to the proof we consider fe C*(V), V = M, so that Y, f and 
Y, fe C”(V) also. Apply (+), first with g = Y,f and then with g = f giving 
the equalities 


[YO fle F = X (Yf) F) = X [X(f F)]. 


Interchanging the roles of Y,, Y, and X,, X, and subtracting, we obtain 


(hi. Y,|f)° F = [X Xi|(f° F), 


which by (+) is equivalent to [X,, X2] and [¥,, Y,] being F-related. I 
We now define the Lie algebra g of a Lie group G. 


(7.10) Corollary If G is a Lie group, then the left-invariant vector fields on 
G form a Lie algebra g with the product [X, Y] and dim g = dim G. If 
F: G, > G, is a homomorphism of Lie groups, F: 91 > 92 is a homomorphism 
of Lie algebras. 


Proof Let a€ G, and let X and Y be left-invariant vector fields. L, (left 
translation) is a diffeomorphism and Lay X = X, Lag Y = Y. Therefore 
Las[X, Y] = [X, Y] by the theorem, so [X, Y] is L,-invariant for any a. 
Hence the subspace g of left-invariant vector fields is closed with respect to 
[X, Y]. Since each X € g is uniquely determined by X,,, the mapping X > X, 
is an isomorphism of g and T,(G) as vector spaces. The last statement follows 
from Corollary 2.10 and Theorem 7.9. | 


(7.11) Remark If H cG is a Lie subgroup, then Corollary 7.10 implies 
that i,(b) is a subalgebra of g. It consists of the elements of g tangent to H 
and its cosets gH. 


(7.12) Theorem Let X and Y be C” vector fields on a manifold M and let 6, 
c denote the local one-parameter group actions they generate. Then[X, Y] =0 
on M if and only if for each peM there is a 6, > 0 such that o,°6,(p) = 
9, ° ap) for tl, ISl < 5p. 


Proof If pe M, then there exists a neighborhood V of p and ô > 0 such 
that c, © 6,(q) and 0, ° a(q) are defined for q € V and |s], |t| < ô. If o, ° 0q) = 
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0, ° a (q) on this set, then by Theorem (5.7), Y is 6,-invariant on V for any 
fixed small t, i.e., 6,(Y,) = Yot,» It follows that 


ae _ 1 
LX, Yl; = (L,Y), = lim i [0_. You,q) — Y] = lim - [Y, — ¥,] = 0. 
t>0 


t>0 t 


Conversely, suppose [X,Y]=0 on M. Given qeV, we let Z,(t) = 
O- Yoa, p) and show Z,(t) = 0. Let q' = O(t, q); then 


; : 1 
Z) = lim —— LO- atar, gl Yaara) — 9-1(%)] 
Ar>0 At 


. 1 
0- {im A EO -a Yocat, a’) — 1) 
= 0.ACL, Y)y) = 0, 


since L, Y = [X, Y] = 0. Hence Z,{t) is constant for |t| < 6 and qe V which 
means that Y is 0,-invariant, and hence, by Theorem 5.7, 0, ° 6, = o, ° 0, on V. 


Exercises 


1. Show that ¥(M) is infinite-dimensional over R but locally finitely gen- 
erated over C”(M), that is, each pe M has a neighborhood V on which 
there is a finite set of vector fields which generate ¥ (M) as a C”(V) 
module. 

2. Let X, YeX(M) and f, ge C”(M) and prove that 


[/X, 9Y] = folX, Y] + f(Xg)¥ — g(Yf)X. 


3. Suppose U, ọ is a coordinate neighborhood on M, X, Y €e ¥(M), and 
E,,..., E, the coordinate frames, and note that [E;, E,] = 0 on U. If 
X =), a'E; and Y = )’, B/E, on U, then show that 


ex 


Op! Oa! 
[X, Y]=} (e p -f ʻE on U. 
: Ox 
4. Given the vector fields in R° (with coordinates x, y, z) 


x= ð 0 : ô ô z ô 0 ð 
~ Vax * ay? ~ 7 oy az =e p er 
compute the components of the three pairs of products. 
5. Show that (Ly Y), depends on the fact that we use vector fields, that 
is, if X, X’ agree at p but are not the same as vector fields, then (Ly Y), 
may differ from (Ly, Y),. 
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6. Write the expressions defining (Lx Y ), (Definition 7.6) in local coordin- 
ates of a coordinate neighborhood U,@ and show that Ly Y is a 
C*”-vector field on U. 

7. Let G = Gi(n, R) with matrix entries as local coordinates. To each 
Xeg we assign the n x n matrix A = (a,;) of components of X,, 
X. = Yi; 4;,(6/0x:;).. Denote this mapping of g onto M,(R) by u. 
Show that u is an isomorphism to the algebra of Example 7.3, i.e., 


HIX, Y] = a(X)u(Y) — w(Y)u(X). 


8. Show that whenever H is a (Lie) subgroup of the Lie group G, then the 
Lie algebra of H may be naturally identified with a subalgebra of g, 
thus verifying Remark 7.11. 

9. Show that two one-parameter subgroups F(t) and G(t) commute 
elementwise if and only if their Lie algebras, in the sense of Exercise 8, 
satisfy [X,¥]=0 for each X,Y in the algebra of F(t), G(¢), 
respectively. 

10. If F: M >N isa diffeomorphism of M onto N and X, Y are C” -vector 
fields on M, then prove that F,(Ly Y) = Lr oo F,(Y), that is, Ly Y is 
F-related to Lr, (x) F,(Y)- 

11. If feC%(M), X, YeX(M), let Ly f = Xf (cf. (3.2) and (7.6)). Show that 
Ly(fY) = (Ly f¥ + f(Lx Y). 


8 Frobenius’ Theorem 


The concept of vector fields on a manifold can be used to give a 
coordinate-free treatment of certain first-order linear partial differential 
equations which is useful even for local questions in R” and indispensable in 
many global questions. First consider an example. 


(8.1) Example We consider the system of partial differential equations 


© Zoga) =en 

defined on some open subset W of R°. Given (a, b,c) € W, a solution, if any 
exists, determined by (a,b,c) will be a function z = f(x,y) such that 
c = f(a, b) and fx, y) = gx, y, £(% y)), f(x, y) = h, y, f(x, y)). Geometri- 
cally, a solution can be thought of as a surface through (a, b, c) whose tangent 
plane at any of its points has the vectors 


ĉ ê é ô 
x=— y, Z)—, Y = — + k(x, y, z) — 
ay TI z) 3z T (x, y, Z) zz 
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as a basis—these being orthogonal to the normal vector 


to the surface z = f(x, y) above. It becomes clear however that for arbitrary 
independent vector fields X and Y, even of the form above, there may fail to 
be a solution. If there is a solution through any point of W, then X, Ycan be 


written as 
ô ô é 0 
X= 2 + £0 =, Yaa the na, 
and 
z Ore ef \ é 
X = = 
LX, Y] E ôy ôx zl 6z 


This is, then, a necessary condition for an everywhere independent pair of 
vector fields on W to determine a surface whose tangent plane they define. It 
reflects independence of order of differentiation in second derivatives. 


(8.2) Some Definitions Let M be a manifold of dimension m = n + k and 
assume that to each peM is assigned an n-dimensional subspace A, 
of T,(M). Suppose moreover that in a neighborhood U of each pe M there 
are n linearly independent C*-vector fields X,,..., X„ which form a basis of 
A, for every q€ U. Then we shall say that A is a C” n-plane distribution of 
dimension n on M and X,,...,X,, is a local basis of A. 

We shall say that the distribution A is involutive if there exists a local 
basis X,,..., X, in a neighborhood of each point such that 


[X;, XJ = > bX, 1<i, j<n. 
k=1 


[The ci; will not in general be constants, but will be C* functions on the 
neighborhood. ] Exercises 1 and 2 clarify this concept. 

Finally, if A is a C” distribution on M and N is a connected C” 
submanifold of M such that for each q € N we have Tq(N) c Ag, then we shall 
say that N is an integral manifold of A. Note that an integral manifold may be 
of lower dimension than A, and need not be a regular submanifold. 


An example of an involutive distribution is the following: Let M = R"** 
and let A be spanned by X; = 0/éx',i = 1,...,n. Then the distribution is the 
subspace of dimension n consisting of all those vectors parallel to R” at each 
point q of M. We shall see that this apparently rather special example is 
actually typical, locally, of involutive distributions. 
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Let A be a C® distribution on M of dimension n, the dimension of M 
being m = n + k. We shall say that A is completely integrable if each point 
péM has a cubical coordinate neighborhood U, œ such that if x',...,x” 
denote the local coordinates, then the n vectors E; = py ‘(@/éx'), 
i= 1,...,n, are a local basis on U for A. Note that in this case there is an 
n-dimensional integral manifold N through each point q of U such that 
T,(N) = A, i.e., dim N = n. In fact, if (at, .. . , a") denote the coordinates of q, 
then an integral manifold through q is the n-slice defined by gris 
a"+!,... x" =a", that is, N= {xe Q(U)|x! =a, j=n+1,...,m}, a 
slice of U. Of course, in this case the distribution is involutive for 


ô ô 
[E;, E] = oy" E 
We shall call U, ọ flat with respect to A. Thus complete integrability is 
equivalent to every point having a flat coordinate neighborhood. Thus any 
completely integrable distribution is involutive (for a generalization, see 
Exercise 4). However, most distributions are not involutive, for example, on 
R? the distribution 


| =o, 1<i j<n 


ð 6 ô ô 
X =x H X = —— + 
j ôx! ôx? 2 ax?" ax3 
is not involutive since [X,, X2] = —@/@x', which is not a linear combination 


of X, and X,. 

An important and instructive example of an involutive distribution is 
furnished by the Lie algebra h of a subgroup H of a Lie group G; b consists 
of left-invariant vector fields on G which are tangent to H at the identity. As 
we have seen (Remark 7.11), this determines a subalgebra, the image of the 
Lie algebra of H under the inclusion mapping. These give a (left-invariant) 
distribution A on G such that A, = 7,(H) for every he H. The cosets gH are 
the integral manifolds of this distribution—which is evidently involutive 
since h is a subalgebra of g. 

A distribution A of dimension 1 is just a field of line elements, that is, 
one-dimensional subspaces. A local basis is given by any nonvanishing 
vector field X which belongs to A at each point and, of course, an integral 
curve of X is an integral manifold. We know from the existence theorem that 
such integral manifolds passing through any given point exist and are 
unique. In fact, Theorem 3.14 says precisely that any such distribution is 
completely integrable. It is also involutive since [X, X] = 0. In the light of 
these remarks, the following theorem may be considered a generalization of 
the existence theorem (Theorem 4.1) to certain types of partial differential 
equations. In the general case, as we have seen, there is a necessary condition 
which is not automatic—as it is in the case of a one-dimensional distribution. 
This condition is the involutivity of A. 
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(8.3) Theorem (Frobenius) 4 distribution A on a manifold M is com- 
pletely integrable if and only if it is involutive. 


Proof We showed above that a completely integrable distribution is 
involutive. This is an easy consequence of the definitions. We shall prove 
that involutive distributions are completely integrable by first proving what 
appears to be a special case. Assume throughout that m = dim M and 
n= dim A. 


Lemma Let X,,..., X, be C” vector fields defined on an open subset U c R" 
such that (i) {X i4- --, Xna} are linearly independent at each qe U and (ii) 
[X,,X;] = 0o0n U, 1 <i,j <n. Then there is a coordinate neighborhood V, w 
around any pe U such that X; = E; = y '(6/0x'), i = 1,...,n, on V. 


Proof Given pe U, we may assume coordinates (x!,...,x™) in R” such 
that p=(0,..,0) and X;,, =(@/dx')o for i=1,...,n. Let Oila) = O(t, q) 
denote the one-parameter group action generated by X;. Using repeatedly 
both the existence theorem and continuity, we may assume that there exists a 
neighborhood W c U of p and a ô > 0 such that for |t;| < ô and for any 


permutation ø of {1,...,n} the mapping 67,,,,°---° Orem is defined on W. 
Since all [X,, X;] = 0 on U, Theorem 7.12 implies that OR 0+ ++ 0 OM (GQ) = 


0} o--- 6% (q) for any qe W and ce G,, the symmetric group of degree n, i.e., 
all permutations of {1,...,n}. Let W’ c W be a neighborhood of the origin 


such that x = (x!,...,x",x"*1,...,x™)e W’ implies that |x‘| < 6,i=1,...,n. 
We define a C” mapping 0: W’ > R” by 
BES sony X, X") = Ohio o OWO, nnp 0, Pane) 


In other words, we project x to the plane of the last m — n coordinates and 
then act by the various one-parameter groups, i.e., move along the various 
integral curves of the X;. This may be done in any order with the same results. 
Note that 6(0,...,0,x"*1,...,x") = (0,...,0,x"*1,..., x”), i.e., 8 is the iden- 
tity on the (m — n)-slice x' = 0, i = 1,...,n, of W”. 

We now consider 0,(@/0x'|,), ae W'. For given i and a, let F(t) = 
6(a!,...,a'~3,t,a't},...,a™); then 6,(0/éx'|,) = F(a’). First note that for 
i>n and a=(0,...,0,a"*',...,a™), 0,(0/0x') = 0/éx'. In particular, this 
holds at p = (0,...,0). For i < n and any ae W’, 


F(t) = @'[01, o---ofio---oO%,(0,...,0,a"*1,...,a”)], 


where the caret denotes that 6! is omitted. We have used the independence of 
order. Let q denote the point in brackets; then F(t) = G'(g), so F(a’) = 
X oiai. = Xioa.g: Thus ,(6/6x'|,) = Xia, and in particular then 
0,(8/8x'|y) = Xip = 0/0x'|o for i=1,...,n. It follows that 0 is a diffeo- 
morphism on a neighborhood V’ c W’ of the origin and that V = @(V’), 
y = 07! is the desired coordinate neighborhood. 
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We now proceed to the proof of the theorem in two steps. First we show 
that in a neighborhood U of any pe M an involutive n-plane distribution A 
has a local basis of vector fields Y,,..., Y, satisfying [Y;, Y] = 0. We begin 
with an arbitrary local basis X,,... X, defined, we may suppose, on a 
coordinate neighborhood U’, œ of p. Relative to the coordinate frames 
X; ig = V1 Vial) Egg and we may suppose coordinates so numbered that the 
matrix («,{q)), 1 < i,j < nis nonsingular on a neighborhood U c U’ of p. Let 
(B;,(q)) denote its inverse and define a new basis of A on U by Y; ae i 
Bi(QX iq = Pir + ae Vile vgs where Yiv = ae 1 pija w i= hi s1; 
von +1,...,m. Since A is involutive [Y, ¥J=)%-,¢ch) h= Xi- í 

CHE, + ya n+1 YerE;) on U. On the other hand, CY, Y] = LE + dy YEr, 
E, + Vu %in Eg = È r-n: OYE,. Since [E,,E,]=0 for 1<a,b<m. It is 
possible for these two linear combinations of E,,..., E,, to be equal only if 
ck. =0on U for 1 <i, j,k <n, so Y,,..., Y, is the desired local basis. 

The second step of the proof is to simply put things together. Assuming A 
is an involutive n-plane distribution on M and pe M, we choose a coordinate 
neighborhood W, o of p on which there is a local basis Y,,..., Y, such that 
[Y,, ¥,] = 0. In o(W) c R" the vector fields Y; = o,(¥;) have the same prop- 
erty so that by the lemma there exists a cootdinate neighborhood V, w of 
o(p), V = o(W)with coordinates y',...,y" and with Y, = y,'(0/éy'), for 
i= 1,...,n. It is no loss of generality to assume Y(V) = C7(0) and y(o(p)) = 
(0,.. 0. We now define the coordinate r. ,ọ of pon M by 
Ü = o7 1(V) with o = Woo. Then op) = (0,...,0), a0) = C%™(0) and E; = 
a, ow, (@/dy') = o% 1(¥,) = Y, for i =1,...,n. This proves that A is com- 
pletely integrable. 


Theorem 8.3 implies the following corollary which is essentially a local 
uniqueness statement for integral manifolds of an involutive n-dimensional 
distribution A in a manifold M of dimension m. 


(8.4) Corollary Let U, ọ be a flat coordinate neighborhood relative to the 
involutive n-plane distribution A on M. Then any connected integral manifold 
C c U must lie on a single n-slice 

Sa = {qe U|x"*1(q) =a"*?,..., xq) = a"}. 


Proof itis clearly sufficient to show that this holds for any integral curve 
t> F(t)c U, a<t < b, since any two points of C lie on such a curve. By 
definition F(t)€Apq) so F(t) = > 7-1 a(t)Eiro. Consider (d/dt)y(F(t) for 
j> n. We have 


d . d j ~ i l= 
P7 y(F() = FG) = 2 ADE = 0 


since E,y’ = 0 unless i=j. Since the curve is connected, y(F(t)) =a’, a 
constant for j =n + 1,...,m, so F(t) lies on S,. I 
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The following result should be compared with Lemma III.6.7. 


(8.5) Theorem Let N c M be an integral manifold of an involutive distribu- 
tion A with dim N = dim A and suppose F: A > M is a C”? mapping of a 
manifold A into M. If F(A) = N, then F is C” as a mapping into N. 


Proof Let F,:4—N denote the mapping of A into N defined by F. 
Then F =i» F,, where i: N — M is the inclusion map—a one-to-one, C” 
immersion. We are to prove that F, is C®. If pe A, let q = F,(p) = F(p) and 
let U, ọ be a flat coordinate neighborhood of q with respect to A. Since 
i: N—M is continuous, Na U is open in N and thus the union of a 
countable collection of disjoint connected open sets, each—as connected 
integral manifolds—lying on an n-slice of U. Let V be that one which 
contains q. Since dim N = n, V is an open subset of the n-slice through q and 
its image (V) is an open subset of the slice Sy = {ye C(O) y" tt =e. = 
y” =0} of g(U). By choosing £' < €, we may assume that (V) ^ C: (0) 
is all of the n-slice through the origin. Let U'= œ7'(C7(0) and 
V' = V a U. Then U’, g’ = ọ|U' is flat and V’ is an open neighborhood of 
N which is exactly an n-slice of the flat neighborhood U’. Let n denote the 
projection, z(y’,...,y") > (y*,..., y"). Then V’, Y'= zg’ is a coordinate 
neighborhood of q on N. The inclusion i is given in these local coordinates by 
EOI... VIO"... y", 0,..., 0). 

Now let W, 8 be a connected coordinate neighborhood of p on A such that 
F(W) c U’. Since F(W) = ic F\(W) c NCU, it must lie on a single slice; 
but qe F(W), so this is the slice V’. In local coordinates (x',..., x’) on W, 
F(x!,...,x") = I,- £700), 0,...,0) with fi being C”. Thus F, in local 
coordinates of W, @ and V’, y’ is given by F,(x',..., x") = (f'(x),..-, f(x). 
It follows that F, is C” on the neighborhood W of p. Since p was arbitrary 
this proves the theorem. I 


(8.6) Definition A maximal integral manifold N of an involutive distribu- 
tion A is a connected integral manifold which contains every connected 
integral manifold which has a point in common with it. 


It is immediate from Corollary 8.4 that a maximal integral manifold has 
the same dimension as A. It is also clear that at most one maximal integral 
manifold can pass through a point p of M. It is true but more difficult to 
prove that there is a maximal integral manifold through every point of M. 
The idea is to piece together local slices using Corollary 8.4 and build up an 
immersed submanifold. The difficulty is in showing that there are not too 
many such slices, that is, in proving that we have a countable basis of open 
sets. We shall not prove this here. Proofs are given by Chevalley [1] and 
Warner [1], for example. We shall state one theorem whose proof requires 
this fact and show how it is used. 
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(8.7) Theorem Let G be a Lie group, g its Lie algebra, and h a subalgebra of 
g Then there is a unique connected subgroup H of G whose Lie algebra is b. 


Proof Let the left-invariant vector fields X,, ..., X,, on g be a basis of b. 
They define an involutive distribution A which is invariant under left 
translations; hence each integral manifold N is carried by any left translation. 
L, diffeomorphically to an integral manifold L,(N). Let H be the maximal 
integral manifold through the identity element e. If he H, then L,_,(h) = e so 
that both H and L,- ,(H) have e in common. Since H is maximal, L,- (H) = 
H. It follows that if h,, ha€ H, then hy th, €H and H is thus a subgroup as 
well as an immersed submanifold. The product mapping H x H>H isa 
composition of inclusion i:H x H+G x G and the product mapping 
6:G x G > G. Both are C” so that @ ci is C” as a mapping into G. Its image 
is in H because H is a subgroup; and by Theorem 8.5 we see that the product 
mapping H x H > H is also C®. A similar argument shows that the mapping 
taking each heH to its inverse h | is also C”. This completes the proof, 
subject to the unproved assertion concerning integral manifolds. I 


An involutive n-plane distribution A on a manifold M defines what is 
known as a foliated structure or foliation on M with the maximal integral 
manifolds as leaves. Thus M = |) pem Np, Np the maximal integral manifold 
through p and locally the flat neighborhoods determine diffeomorphisms to 
R™ = R" x R*, n+ k = dim M, with leaves corresponding to submanifolds 
R” x {(a"*?,...,a”)}. If n = 1, the leaves are just integral curves of a vector 
field. Foliations have many interesting properties (see Haefliger [1] and 
Lawson [1]) since the existence of a foliation is equivalent to the existence of 
a rather special type (involutive) of k-plane distribution. More precisely, if a 
foliation is given, tangent spaces to leaves determine an involutive distribu- 
tion and conversely. 


Exercises 


1. Show that if a distribution is involutive with respect to one choice of 
local basis on an open set U, the same will be true for any local basis 
whose domain is in U. 

2. A vector field X is said to belong to a distribution A if for each p e M we 
have X,¢A,. Show that a C® distribution is involutive if and only if for 
every pair of C®-vector fields X, Y on M which belong to A the vector 
field [X, Y] belongs to A. 

3. Let N be a maximal integral manifold of a distribution A on M. Show 
that if N is closed (as a subset), then it is a regular submanifold of M. 

4. Let N = M bea submanifold, and let X, Y e X¥(M) such that if pe N, then 
Xp Y,€ TN). Show that peN then implies [X, Y], € T,(\). 

5. Let F: M >N bea C” submersion of M onto N. Show that F~‘(q) for 
all ge N are the leaves of a foliation of M. 
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The following problems are crucial steps in the proof of the existence of 
maximal integral manifolds of an involutive distribution A on M. Assume 
that dim A = n. 


6. Let N,, N, be integral manifolds of A with dim N, =n = dim N.. If 
N,N, # Ø, then N = N, UN, is an integral manifold with N, and 
N, as open submanifolds. 

7. M may be covered by a countable collection of flat coordinate neighbor- 
hoods {U,, ¢;}. If N is a connected integral manifold, dim N = n, then the 
set { )(N ^ U) is a countable collection of n-slices of the U;. 

8. Continuing the notation of Exercise 6, show that ~ is an equivalence 
relation on M: p ~ q if there is a finite chain S,,...,S, of n-slices of flat 
neighborhoods of the collection {U;, g;} such that peS,, geS,, and 
S,;0S;4; # @,i=1,...,r— 1. [The equivalence classes can be shown 
to be maximal integral manifolds. The hard part is second countability. ] 


9 Homogeneous Spaces 


In this section we consider the action of a Lie group on a manifold in a 
special but very important case, transitive action. Let 0: G x M > M 
denote such an action. Then we recall that it is transitive if for every pair 
p,qeéM, there is a geG such that 0,(p) = q. This means that as far as 
properties preserved by G are concerned, any two points of the manifold are 
alike. 


(9.1) Definition A manifold M is said to be a homogeneous space of the 
Lie group G if there is a transitive C” action of G on M. 


Many examples of group action have this property: O(n) acts transitively 
on S"~ +, Gl(n, R) acts transitively on R” — (0) and so on; these were discussed 
in Section JJI.7. But one of the most important examples remains to be 
treated, since until this moment we have lacked an essential tool: Frobenius’ 
theorem. This example, viewed first from a purely set theoretic standpoint, 
is the following: Let G be a group, H any subgroup, and G/H the set of left 
cosets. We define a left action 4:G x G/H — G/H by A(g, xH) = gxH; it is 
a left action since 


(i) Ale, xH) = xH, and 
(ii) Agi, A(g2, xH)) = Agi, 92 xH) = (91 92)xH = Algı g2, xH). 
Moreover, if n: G > G/H is the natural mapping of each ge G to the 
coset which contains it, z(g) = gH and if L,: G > G denotes left translation, 
then we have the property: 
(iii) mo L, = Ayo x (for all ge G). 
The transitivity is apparent: A,,-1(xH) = yH for all x, ye G. 
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Of course, we are far from being able to assert that when G is a Lie group, 
then G/H is a manifold and the mappings 4 and z defined by G and H are 
C”. We did see, however, that if H is closed in G (a Lie group), then the 
quotient topology on G/H makes it a Hausdorff space and x an open—as 
well as continuous—mapping (Theorem III.7.12). We left it as an exercise to 
the reader to show that with this topology on G/H, / is a continuous group 
action. In this section we shall go further and show that G/H is a manifold 
and A is a C” action. Aside from the fact that this will give us many new 
examples of manifolds and group action, this is important for another 
reason: the manifolds G/H with G acting by left translation form a universal 
model for all transitive actions, that is, for all homogeneous spaces. 

First, consider this last statement from the set-theoretic viewpoint— 
without topology or differentiable structure. Let X be a set on which a group 
G acts transitively by the rule 0: G x X — X. Choose, arbitrarily, a point 
ae X and let the isotropy subgroup (or stability group) of a be H, 


H = {ge G | 0,(a) = a}. 


We then define a mapping F: G > X by F(g) = 0,(a). Since 0 is transitive, F 
is onto; moreover for any xe X, F~ +(x) = gH, where g is any element of G 
such that F(g) = x. It is then easily verified that F induces a one-to-one onto 
mapping F: G/H > X by F(gH) = F(g). For these mappings we have the 
relation F o x = F. Finally F carries the natural action of G on G/H, which 
we denoted by 4 above, to the action 0, that is, 


Fol,=0,°F 


for every ge G. Thus from the set-theoretic and abstract group viewpoint, 
A: G x G/H > GJH is equivalent as an action to 0: G x X > X. 

Of course, it is very interesting and important to see to just what extent 
this still holds in the case of the transitive action of a Lie group on a 
manifold. Recall that by Definition III.6.17 a Lie subgroup H of a Lie group 
G is an immersed submanifold which is a Lie group with respect to the group 
operations of G. Since we shall use the quotient topology on G/H, we must 
restrict our attention to those Lie subgroups that are closed subsets if G/H is 
to be a Hausdorff space (Theorem III.7.12). Therefore H will be assumed to 
be a closed Lie subgroup. [We prove later that then H is a regular 
submanifold of G (compare Remark III.6.19).] A section V, o on G/H will 
mean a continuous mapping o of an open subset V of G/H into G, o: V > G, 
such that z o ø is the identity on V. We then have the following basic fact. 


(9.2) Theorem Let G be a Lie group and H a closed, Lie subgroup. Then 
there exists a unique C*-manifold structure on the space G/H with the proper- 
ties: (i) z is C” and (ii) each g e G is in the image o(V) of aC™ section V, o on 
G/H. The natural action 4: G x G/H —> G/H described above is a C™ action 
of G on G/H with this structure. The dimension of G/H is dim G — dim H. 
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Now suppose that a Lie group G acts transitively on a manifold M, the 
action being given by the C®-mapping 0: G x M — M. Using the notation 
above, with X replaced by M, we suppose ae M and that H is the isotropy 
subgroup of a. We then have the following closely related theorem to com- 
plete the picture. 


(9.3) Theorem The mapping F: G > M, defined by F(g) = 0(g, a), is C” 
and has rank equal to dim M everywhere on G. The isotropy group H is a 
closed Lie subgroup so that G/H is a C” manifold. The mapping F: G/H > M 
defined by F(gH) = F(g) is a diffeomorphism and F o 4, = 0, F for every 
gEG. 


Before proving these theorems we give some examples of their use. First 
consider briefly some of the spaces associated with classical geometries: 
E"—Euclidean space, P"(R)—the space of real projective geometry, and 
H?—the space of plane non-Euclidean geometry. All of these were dis- 
covered and studied before Lie groups (or groups of any kind) were in- 
vented. However, in each case there is an underlying group, the group of 
automorphisms of the geometry; it is the group by which we can bring 
congruent figures into congruence. In fact each geometry studies precisely 
the objects and properties which are invariant under the transformations of 
this group acting on the space. For E", or R", the group consists of all 
isometries (rigid motions): translations, rotations, and reflections; for P"(R) 
it is the projective transformations; and for H? it is the group which leaves 
non-Euclidean distances unchanged (“rigid” motions again!) In each case 
the group is a Lie group and in each case it is transitive. This means that the 
theorems above can be used as a sort of underlying unifying principle of all 
these geometries, a fact which was recognized by F. Klein [1] and resulted in 
a famous approach to geometry called the “Erlangen Program” (1872). 
Thus the study of any of these classical geometries can be reduced to a study 
of Lie groups G and their subgroups H. This point of view pervades much of 
modern geometry. Consider now what G and H are for the cases above. 


(9.4) Example We have seen in Example III.7.6, that the group of rigid 
motions of E", identified with R”, is a group G which is O(n) x V" as a 
manifold, but whose group product was defined by (A, v)(B, w) 
= (AB, Bv + w) and whose action on R” is given by (A, v): x= Ax +0 
(see Exercise III.7.6). Another approach is the following: we identify G with 
the (n + 1) x (n + 1) matrices of the form 


: : : Ai `? Ain 
g= ae ; A= : : | € O(n), 


Qnt H Ann 
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and points x = (x!, ..., x") of R” with the column vector X = ‘(x',..., x", 1). 
Then 0(g, x) = gx, the product of the matrices g and x. The subgroup H 
leaving the origin x = (0, ..., 0) fixed is the set of all of these matrices for 
which v; =+: = v, = 0; hence it is a closed Lie subgroup isomorphic to 
O(n). 


(9.5) Example The group G = Si(n + 1, R) acts transitively on P"(R) as 
follows: let [x] ¢ P"(R). Then [x] is an equivalence class of nonzero elements 
x = (x'......x"*') of R"*!. Given any ge Si(n + 1, R), we define 6(g, [x]) by 


O(g, [x]) = lax], 
where gx is the matrix product of g with x written as a column vector 
((n + 1) x 1 matrix). This is a C” action and is transitive; the proof is left to 
the exercises. The isotropy subgroup H of [(1, 0, ..., 0)] is the set of elements 
(a;;) of Si(n + 1, R) with a,, #0 and all other entries of the first column 
equal to zero. It is easily seen that H is a closed, Lie subgroup of G. 


The non-Euclidean (or hyperbolic) plane H? will be discussed in the last 
chapter. It is equivalent to Si(2, R)/SO(2) in the sense of theorem 9.3. 

One of the more important uses of these ideas and of Theorem 9.2 is the 
relatively simple way it provides for establishing that certain sets are C* 
manifolds in a natural way. The best illustrations are the Grassman mani- 
folds G(k,n) of k-planes through the origin in R". It was proved in 
Section III.2 that these were manifolds, but the proof was quite complicated 
and only sketched at some points. This same result may be shown as follows. 
The group Gi(n, R) acting in the natural manner on R” is transitive on 
k-planes through the origin. This is an immediate consequence of the fact 
that it is transitive on n-frames: given {v,,..., v,} any linearly independent 
set of vectors, then there is a uniquely determined, nonsingular, linear trans- 
formation taking it to any second linearly independent set {w,,..., W,}- 
However, if Gl(n, R) is transitive on n-frames, it is necessarily transitive on 
k-frames since each set of k linearly independent vectors can be completed to 
a basis. It follows that Gi(n, R) acts transitively on the set M = G(k, n) of 
k-planes through 0. If the isotropy subgroup H of some point of M, that is, a 
k-plane through 0, is a closed Lie subgroup, then Gi(n, R)/H is a C” mani- 
fold by Theorem 9.2 and is in natural one-to-one correspondence with M. 
Thus we may take on M the topology and C® structure which makes this 
correspondence a diffeomorphism. However, H is such a subgroup, for the 
k-plane of R" spanned by the vectors e, =(1,0,...,0), ..., 
e, = (0,..., 1,0,..., 0) is carried onto itself by the subgroup H  Gi(n, R) 
consisting of matrices of the form 


(35) 
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where AeGi(k, R), BE Gl(n — k, R), and C is an arbitrary k x (n — k) 
matrix. Therefore the Grassmann manifold G(k, n) is indeed a C* manifold. 
This method is frequently used in practice to show that some rather com- 
plicated objects can be endowed with the structure of a differentiable mani- 
fold (uniquely, according to Theorem 9.3). It may be summarized as follows: 


(9.6) If G is a Lie group and G acts on a set X transitively in such a way that 
the isotropy subgroup of some point a of X is a closed Lie subgroup, then there 
exists a (unique) C* structure on X such that the action is C™. 


This principle as well as other results of this section are susceptible to 
further refinements and weakening of hypotheses (see, for example, Helga- 
son [1, Chapter II, Sections 3 and 4]). Of course, our treatment above of 
G(n, k) depends on Theorem 9.2, which we are now ready to prove. 


Proof of Theorem 9.2 The topology on G/H is given; it is uniquely 
determined by the requirement that x: G > G/H be open and continuous. 
Moreover 4: G x G/H > G/H is a continuous action. For let U be an open 
set of G/H, then we will show that 4~1(U) is open. Let W be the subset of 
G x G such that every pair (gı, g2)€ W has its product g, g3 in x~'(U), 
which is an open subset of G. W is open since it is the inverse image of 
x (U) under the continuous mapping (g;, g2) > gı g2. The natural map- 
ping of G x G > G x G/H given by (9,, g2) > (g1, 2(g2)) is open so it car- 
ries W onto an open set which is exactly A~'(U). 

We now need to use Frobenius’ theorem, which we apply to the left- 
invariant distribution A determined by A, = T,(H). As a basis A has any 
basis of left-invariant vector fields in h, the Lie algebra of H viewed as a 
subalgebra of g; and the integral manifolds of A are exactly the left cosets 
gH—as remarked in the previous section. It follows that there is a cubical 
neighborhood of e whose intersections with the cosets gH are a union of 
slices. To complete the proof we need a sharper result given by the following 
lemma, which is important in its own right. 


(9.7) Lemma If H is a connected Lie subgroup of G which is closed as a 
subset, then each coset gH is a closed, regular submanifold, and there is a 
cubical neighborhood U, @ of any geG such that for each coset xH either 
xH ^ U is empty or a single (connected) slice. 


Proof That H and each of its cosets is a regular submanifold is an 
immediate consequence of the second part of the statement, which asserts, in 
particular, that H and its cosets have the submanifold property (Definition 
III.5.1). Since each coset is a maximal integral manifold of the distribution 
A = 5, as we saw in the previous section, every ge G has a flat coordinate 
neighborhood U, ø, whose slices—determined by fixing the last m — n 
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coordinates (n = dim H, m = dim G)—are integral manifolds, each an open 
set of a coset xH of H. We must now verify that U may be taken sufficiently 
small that for each x, xH ^ U is empty or is a single slice. Since A, integral 
manifolds, etc., are invariant under left translation by elements of G it is 
enough to check this for the special case x = e. If U’, g’ isa flat neighborhood 
of e, and if U’ ^ H consists of a single slice, then we need only choose U c U’ 
small enough that UT +U c U’ and so that U, ọ = g'|U is still a cube in order 
to complete the proof. For if x, y € U are on distinct slices of U but belong to 
the same coset, that is, xH = yH, then y~'x and e are elements of U’ ^ H but 
lie on distinct slices (because L, _ , is a diffeomorphism and carries slices into 
slices). Since this contradicts our assumption about U’, it cannot happen. 
Thus it remains to show the existence of U’, ọ'. We begin with an arbitrary 
flat neighborhood V, y of e, Y(V) = C70), whose slices S(a"*?,...,a™) = 
{qe V|x(q) =a, j=n+1,...,m} are integral manifolds. We saw in the 
proof of Theorem 8.5 that the collection of distinct slices on H, that is, V m H, 
is countable and hence corresponds to a countable set of points 
{(a"*+,...,a™)} of the cube C7~"(0). Restricting slightly to a closed cube 
V’ = W~1(C%(0)), 6 > 5’ > 0, we may suppose this countable set is closed, for 
H is closed and V’ ^H is closed. Since a closed countable subset of R"~" 
must contain an isolated point (Exercise 8) it follows that H ^ V’ contains an 
isolated slice. By translation invariance we may suppose this to be the slice 
through e. Then it is possible to choose z’, 0 < &' < 0’, so that y~ 1(C™(0)) = 
U' and ¢’ = y |U’ have exactly the property we have seen is needed: H ^ U’ is 
a single slice and contains the identity e. This U’, ọ' as we have seen enables 
us to complete the proof of the lemma.t | 


Resuming the proof of Theorem 9.2, we restrict our discussion entirely to 
cubical neighborhoods U, ¢ of the type described above with g(U) = C7'(0) 


and suppose that in the local coordinates x’, ..., x", x"*1,..., x™, the slices 
obtained by holding x"*',..., x” fixed are the intersections of cosets gH 
with U. Let A = {qe U |x"(q) = ++: = x"(q) = 0} and 


yi A~C—"0) eR 


be defined by ’(q) = (x"*'(q), -.., x(q)); A is a C? submanifold of G, 
contained in U, and yw’ is a diffeomorphism. By our choice of U, p we see that 
A meets each coset of H which intersects U in exactly one point. Therefore, x 
maps A homeomorphically onto an open subset V of G/H. We denote the 
inverse by a; thus ø: V > G is a continuous section with o(V) = A. Suppose 
that U, ọ and U, @ as just chosen are such that V = n(A) and V = x(A) 


+ As remarked earlier (compare Remark III.6.19), the conclusion follows from the much 
weaker hypothesis: H is an algebraic subgroup and a closed subset of G. Moreover, our 
assumption that H is connected is not necessary (see Exercise 9). 
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have common points. The set V ^ V is open and it is not difficult to verify 
that the corresponding subsets W = o(V ^ V) and W = G(V 17) are dif- 
feomorphic with respect to the natural correspondences 6 o n: W > W and 
co n: W—W (Exercise 4). It follows that the collection of open sets 
V = n(A) over all U, g of the type above together with the homeomor- 
phisms y = y'o ø: V > C} "(0) determine a C® structure of the type 
required by the conclusions of the theorem. The uniqueness follows from 
requirements (i) and (ii). For if we have two differentiable structures on G/H, 
we see that the identity is a diffeomorphism as follows: factor it locally into a 
section o: V > G of the first structure followed by projection z, which is C” 
onto the second structure. Thus the identity is a C” mapping of G/H with 
structure one to G/H with structure two since this holds on each domain V. 
But the converse is also true, so the structures agree. Finally 4:G x G/H 
> GJH is C® since it may be written on the domain V of a section as 
A(g, xH) = n(go(x)). This completes the proof of Theorem 9.2. i 


We now prove the second principal result. 


Proof of Theorem 9.3 F:G— M is C® since F(g) = 0(g, a) and 0 is C” 
by assumption. From 


F o L(x) = F(gx) = 0, F(x), 


from the chain rule, and from the fact that both L, and @, are diffeomor- 
phisms we see that the rank of F is the same at every g e G. It follows that 
F~‘(a) = H is a closed submanifold (Theorem III.5.8) and satisfies the 
hypotheses of Theorem 9.2. At e we have F,: T,(G) > T,(M); but each 
X€ T,(G) is the tangent vector at t = 0 to the curve g(t) = exp tX so that 
the vector F,(X,) is the tangent vector to F(exp tX) = O(exp tX, a) at a 
(which corresponds to t = 0). Since 8 restricted to g(t) = exp tX is an action 
of R on M, then by Theorem 3.6 F,(X,.) is zero if and only if 
(exp tX, a) = a for all t, that is, exp tX c H, or X e T,(H) the subspace of 
T,(G) corresponding to the subgroup H. Hence ker F pe = TAH) = Ker Tye 
and, as noted, dim ker F, is constant on G as is dim ker z, . Since F is onto, 
it follows from Theorem II.7.1 that dim M = dim G — dim H = dim G/H. 

Now consider F: G/H —> M. Let q€ G/H and V, ø a section defined on a 
neighborhood V of q. Since ø is C” and F | V = F o ø, we see that F is C” in 
a neighborhood of every point, hence C” on G/H. We know that F is 
one-to-one and onto from set-theoretic considerations and if ker F, = {0}, 
that is, rank F = dim G/H = dim M everywhere, then F must be a diffeo- 
morphism. Let q be any point of G/H and suppose q = x(g). Then using 
F = Fo mand the chain rule we see that F,: T,(G) > Tho (M) is given also 
by F, ° 2,. Since dim ker F, = dim ker z,, we must have dim ker F, = 0, 
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as we wished to prove. The fact that F ° 1, = 0,° F was already noted; both 
4, and 0, are diffeomorphisms. the by ‘Theorem 9.2 and the latter by 
hypothesis. This completes the proof of Theorem 9.3. I 


Exercises 


1. Verify in detail statements (i)-(iii) at the beginning of the section con- 
cerning the action of G on G/H and that if G acts transitively on a set X, 
then the mapping F:G/H —> X defined above is in fact one-to-one, onto, 
and satisfies F o A, = 0, © F as claimed. 

2. Check that the isotropy subgroup H c Gi(n, R) defined in discussing 

G(n, k) is, in fact, closed and a Lie subgroup. 

3. A sequence of n ee of R", V= R" sa Vy So - > Va with 
dim V, = j, j= 1, ..., n, is called a flag of R”. Verify that the natural 
action of Gi(n, R) on R” is transitive on the set of flags M and use this to 
obtain the structure of a C* manifold on M. What is dim M? 

4. Verify that the sets W, W corresponding to the overlapping part V ^ V 
of the domains of two sections V, ø and V, é as defined in the proof of 
Theorem 9.2 are diffeomorphic. [Note that translations on G by ele- 
ments of H are diffeomorphisms and have the property that z o R, = 7. 
They may be used to bring a pair of corresponding points p, p of W, W 
into coincidence. ] 

5. Let G bea Lie group and H a closed Lie subgroup which is normal in G. 
Then show that G/H is a Lie group with the C® structure of 
Theorem 9.2 and z: G > G/H is a Lie group homomorphism. 

6. Let G, and G, be a Lie group and F: G, > G, a Lie group homomor- 
phism. Then show that the kernel of F is a closed Lie subgroup and if F 
is onto, then G, = G,/ker F. 

7. Show that the subgroup O(n) of Gi(n, R) acts transitively on the Grass- 
mann manifold G(k, n) and find the isotropy subgroup of the k-plane in 
R” spanned by e}, ..., ex, the first k vectors of the standard orthonormal 
basis (compare the remarks preceding Example III.7.4). 

8. Prove that a closed countable subset of a Euclidean space R* has an 
isolated point. [Hint: Use the Baire category theorem. ] 

9. Let H bea Lie group and let Ho denote the component of e. Show that H 
has at most a countable number of connected components, all both open 
and closed and all diffeomorphic to Hy, and show that Hp is a Lie 
subgroup. 


Appendix Partial Proof of Theorem 4.1 


Proof (Part (I) of the existence theorem (Theorem 4.1) for ordinary 
differential equations) We are given n functions f(t, x) defined and of class 
C on an open subset J, x U c R x R", I, = {~e < t < e, £ > 0}. We must 
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show that for each x e U there is a neighborhood V and a ô > 0 such that for 
each ae V there exist unique functions x'(t), —d < t < 6 satisfying 


(+) = f'(t, x(t)) 
and 
(«x) x'(0) = a, i=1,...,n. 


First note that if x(t), i = 1,..., n, are continuous functions defined for 
|t| <6 and they satisfy 


x(t) =a + [re x(t)) dr, 


then by the fundamental theorem of calculus they are of class C! at least and 
satisfy both (*) and (««). From (x) it then follows that they must be of class 
C't! at least, since their derivatives are of class C’. 

We may write this set of integral equations for x1(¢),..., x"(t) as an 
equation in n-tuples 


x(t)=a + [ie x(t)) dt. 


For a given x, € U we choose r,0 < r < 1 such that B3,(x9) =€ U and an ¢' 
satisfying ¢ > £’ > 0, so that T, c I,. Thus f(t, x) are of class C", r > 1, on 
the compact set I, x B3,(xo); therefore both the given functions f‘ and their 
derivatives are bounded on this set. It follows that we may choose M > 1 
such that both M > sup || f(t, x)|| and Ml/x — y|| = || f(t, x) — f(t, y)|| for 
all te J,, and x, ye B3,(x9). The last inequality results from the mean value 
theorem and the continuity of the derivatives. Choose a positive 6 such that 
ô < r/M?. 

We shall prove the theorem with this 6 and with V = B,(xọ)—denoted B, 
in what follows. Let a € B, and let ¥ be the collection of all continuous maps 
p(t) = (g'(t), -.-, o"(t)) of Is into B,,(a) satisfying (0) = a. By virtue of the 
preceding comments it is enough to show that there is a unique member of 
this collection satisfying 


o = L(ọ)=a + |se g(z)) dr 


in order to finish the proof. This will be done by proving that L: F > F isa 
contracting mapping on a complete metric space and applying 
Theorem II.6.5; we have: 


(1) F is a complete metric space with d(ọ, Y) = sup, <7, lolt) — W(t)|| 
since this is the topology of uniform convergence of continuous functions on 
a compact space. 
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(2) IfoeF, then L(y)€F so that L maps F to F. It is clear that 
L(q) is continuous; in fact, it is at least C’, and when t = 0 the function 
L(g) has the value a. It is only necessary to check that if |t| < ô, then 
|| L(@)(t) — al] < 2r. This results from 


| Lee) — a] = < ji f(t, o(t))|| dt < Mô < a <r. 


| 1600) 
(3) Finally, L is contracting. Let ọ, Y E€ F, 
ILo) = LOL < | IE 00) -AE Ye) a 


< òM sup|g(t) — v(e) 


tels 


< ôM dlo, Y) = 3; dlo, Y): 


Butr < 1 and M > 1 so that we have 
|L(o) — LW) < k diy), where O<k<1. 


By the contracting mapping theorem there is a unique g(t) satisfying the 
conditions. This completes the proof. 


Notes 


For those readers who wish to delve somewhat further into some of the important topics 
taken up in this chapter and, especially. to find complete proofs of the basic existence theorem 
for systems of ordinary differential equations, the existence of maximal integral manifolds for 
involutive distributions, and more details on homogeneous manifolds, a comment on some of 
the references may be helpful. 

Theorem 4.1 and the material of Sections 3-5 may be found in many places. A concise and 
very straightforward treatment of this theorem and related material is to be found in the book 
of Hurewicz [1]. It is also very elegantly treated (as is Frobenius’ theorem and the inverse 
function theorem) in Dieudonné [1, Chapter X]. Although it is well along in the book, the basic 
ideas can be followed without reading through all of the previous chapters. For a treatment 
which is specially adapted to differentiable manifolds (including local one-parameter group 
action) and is beautifully done see Lang [1]; the author found all of these sources very helpful. 
Both Dieudonné and Lang treat the subject from a very general point of view, that of Banach 
spaces and manifolds modeled on them. Although this may disturb some readers, it will appeal 
to others. In any event. it is not difficult to reduce the level of generality—the ideas and proofs 
are very clear. Other sources are Hirsch and Smale [1] and Arnold [1]. 

For Frobenius’ theorem, particularly from the global point of view, as well as other topics 
such as Lie groups and the fundamentals of homogeneous spaces, every reader should be 
acquainted with the classical book by Chevalley [1]. which greatly influenced all subsequent 
treatments. The exposition here was very much influenced by the proof of W. L. Chow. For 
references, a very instructive proof, and a useful generalization the reader is urged to consult 
Sussman [1]. The book by Warner [1] and the book of Spivak [2] should also be helpful to the 
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reader who wants to fill in gaps or just to read another (and more complete) treatment of the 
material of Sections 8 and 9. Finally, the books by Helgason [1] and Kobayashi and Nomizu [1] 
go much further into the ramifications of subjects treated here (especially Section 9). Many of 
these books have rather complete bibliographies from which the reader can search out further 
sources. The theorem of Cartan on closed subgroups is proved in Chevalley [1], Helgason [1], 
and Hochschild [1], and Adams [1]. 
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With some minor exceptions this chapter contains all of the basic material on tensor fields 
on manifolds which is used in the succeeding chapters. We limit ourselves to discussing covar- 
iant tensors since we will rarely need any other type—except vector fields; any other cases will 
be developed as needed. 

A covariant tensor on a vector space V is simply a real-valued function O(v,,..., ¥,) of 
several vector variables v,,...,¥, of V, linear in each separately (that is, multilinear). The 
number of variables is called the order of the tensor. A tensor field ® of order r ona manifold M 
is an assignment to each point pe M ofa tensor ®, on the vector space T,(M) which satisfies a 
suitable regularity condition (C°, C’, or, C*) as p varies on M. Sections 1 and 2 discuss the two 
simplest examples: r = 1 corresponding to functionals on a vector space and r = 2 correspond- 
ing to bilinear forms on a vector space. The latter includes the important case of a Riemannian 
metric, which is a covariant tensor field ® of order 2 with the property that ®, is an inner 
product on 7,(M) for every pe M. It is the added structure given by such a tensor which enables 
us to measure distances, angles, volumes, and so on, on a manifold M and thus to carry over 
large portions of Euclidean geometry to abstract manifolds. This concept was foreshadowed in 
the work of Gauss for surfaces, but was discovered and expounded by Riemann for whom the 
metric tensor was named. Many of the consequences of Riemann’s discovery are treated in 
Chapters VII and VIII; only one or two of them here. In particular, in Section 3 we show thata 
Riemannian metric gives rise to a metric d(p, q) on M; and in Section 7 we show that it defines a 
volume element on M. 

In Section 4 the basic notion of partition of unity is introduced. This is an indispensable 
tool used to piece together functions, mappings, forms, vector and tensor fields and other 
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objects, which can be defined locally on a coordinate neighborhood (that is, on an open subset 
of R”), to obtain a globally defined objects on M. The first application is to prove the existence 
of a Riemannian metric on any C” manifold. Several other applications are given here and in 
the remainder of the chapter. 

In Section 5 covariant tensor fields are discussed in more generality and in particular we 
treat the case of symmetric and alternating tensors—tensors which are unchanged (respectively, 
change sign) when two variables are interchanged. Of these two, the alternating are the most 
important to us since they correspond to exterior differential forms which we use frequently in 
the next chapters. In Section 6 we see that tensors can be added and multiplied (like any 
functions to the real numbers) resulting in an algebra of tensors. A slight modification of the 
product gives the exterior product of alternating forms. When we add to this the basic notion of 
derivatives of such forms, which is defined in the last section, we have all of the basic ingredients 
for the calculus of exterior differential forms on M. The exterior forms on M form an algebra 
A(M) on which differentiation is a linear operator. As we shall see, this algebra plays a basic role 
in the geometry of manifolds. 

(Sections 6-8 are used in an essential way in Chapter VI, but much of Chapter VII and 
parts of Chapter VIII may be read without knowledge of differential forms.) 


1 Tangent Covectors 


In this chapter we suppose that V is a finite-dimensional vector space 
over R and let V* denote its dual space. Then V* is the space whose elements 
are linear functions from V to R; we shall call them covectors. If o e V*, then 
o: V > R, and for any ve V, we denote the value of o on v by a(v) or by 
<v, «>. Both notations are useful. Recall that addition and multiplication by 
scalars in V* are defined by the equations 


(0, + o2)(v) = a,(¥) + o2(¥),  («0)(v) = a(a(v)), 


giving the values of 0, + c, and ao, «e R, on an arbitrary ve V, the right- 
hand operations taking place in R. 

Knowledge of linear algebra is assumed, but by way of review we men- 
tion three frequently used facts. 


(i) If F,: V— W ıs a linear map of vector spaces, then it uniquely 

determines a dual linear map F*: W* — V* by the prescription 

(F*o)(v) = o(F,(v)) or <v, F*(o)> = <F,(v), o>. 
When F, is injective (surjective), then F* is surjective (injective). For 
composition of linear maps, we have (G, ° F,)* = F* o G*. 

(ii) Ife,,...,e, is a basis of V, then there exists a unique dual basis 
a", ..-, 0" of V* such that w'(e,) = ô}. (The symbol ô$ is zero if i + jand +1 
if i = j.) 

If ve V, then w'(v),...,@"(v) are exactly the components of v in the basis 


e,,-..,e,- In other words v = )%_, a (v)e;. This is a consequence of the 
preceding definitions (see Exercise 1). 
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Observe that in property (i), the definition of F* does not require the 
choice of a basis; therefore F* is naturally or canonically determined by F,. 
According to (ii), the vector spaces V, V* have the same dimension, thus 
they must be isomorphic. There is no natural isomorphism; however, we do 
have the following property: 


(iii) There is a natural isomorphism of V onto (V*)* given by 
v > <v,.), that is, v is mapped to the linear function on V* whose value on 
any ae V* is <v, o>. Note that <v, o> is linear in each variable separately 
(with the other fixed). 


This shows that the dual of V* is V itself, accounts for the name “ dual” 
space, and validates the use of the symmetric notation <v, o> in preference to 
the functional notation o(v). We shall see in the next section that when 
further structure is assumed, for example, an inner product on V, then there 
is an associated natural isomorphism of V and V*; thus V, V*, and V** can 
all be identified in this case. This is apt to be more a source of confusion than 


joy. 


Covectors on Manifolds 


Let M be a C® manifold and assume pe M. We denote by T$(M) the 
dual space to T,(M), thus ,€ T}(M) is a linear mapping o,:T,(M)>R 
and its value on X,,€ T,(M) is denoted by o (Xp) Or (Xp, Fp». Given a basis 
Ej ps -<-> Enp Of T,(M), there is a uniquely determined dual basis œp, ..., @p 
satisfying, by definition, wi(E,,) = ô}. The components of a, relative to this 
basis are equal to the values of ø, on the basis vectors Ej ps+++) Enp, thus 


Tp = 2 6 (Eip)p: 


this is the dual statement to property (ii) above. 

Just as we defined a vector field on M, so may we define a covector field: 
It is a (regular) function ø, assigning to each pe M an element o, of T3(M). 
As with vector fields, we denote such a function by ø, 4, ... and we denote by 
Gp» eee UC value at p, that is, the element of T*(M) assigned to p. Ifa isa 
covector field and X is a vector field on an open subset U of M, then a(X) 
defines a function on U: to each pe U we assign the number a(X)(p) 
= 0,(X,). [Note: We often write o(X,) for ¢,{X,) if o is a covector field]. 


These remarks enable us to state the formal definition. 


(1.1) Definition A C’-covector field o on M,r > 0, is a function which 
assigns to each pe M a covector o,€ T*#(M) in such a manner that for any 
coordinate neighborhood U, g with coordinate frames F,, ..., E,,, the func- 
tions o(E,), i= 1,...,n, are of class C" on U. For convenience, “ covector 
field” will mean C®”-covector field. 
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We remark that the following (apparently stronger) regularity condition 
could be used to replace the requirement of the definition—thereby avoiding 
the use of local coordinates. 


(1.2) Suppose that o assigns to each pe M an element o, of T*(M). If o is of 
class C", then for any C®-vector field X on an open subset W of M the function 
a(X) is of class C on W, and conversely. 


To see this we take a covering of W by coordinate neighborhoods of M 
(whose domains are in W); let U,@ be such a neighborhood. Then 
X = }_ dE; on U, where a! are C” on U. Thus o(X) = Y a‘o(E;) on U and is 
C" if o(E,),..., o(E,) are. Hence the condition just given implies o(X) is of 
class C” on a collection of open sets covering W and so on W itself. The 
converse is obvious. 

Note that if E,,..., E, is a field of (C”) frames on an open set U c M, 
then the dual basis at each point of U defines a field of dual bases w!, ..., œ" 
on U satisfying w'(E,) = ô. We call this a field of coframes—coordinate 
coframes if E,,..., E„ are coordinate frames. The w!,..., œ" are of class C” 
by the criterion just stated, and covector field ø is of class C” if and only if for 
each coordinate neighborhood U,@ the components of o relative to the 
coordinate coframes are functions of class C’ on U. 


(1.3) Remark It is important to note that a C’-covector field o defines a 
map c: ¥(M)— C'(M) which is not only R-linear but even C’(M)-linear! 
More precisely, if f, ge C'(M) and X and Y are vector fields on M, then 


a(fX + gY) = fo(X) + go(Y), 


for these functions are equal at each pe M (as the reader should verify). 


(1.4) Example If fis a C” function on M, then it defines a C?-covector 
field, which we shall denote df, by the formula 


(X,,4,> =X,f r af (Xp) = Xp f. 


For a vector field X on M this gives df (X) = Xf, a C” function on M. This 
covector field df is called the differential of f and df,, its value at p, the 
differential of f at p. In the case of an open set U c R", we verify that it 
coincides with the usual notion of differential of a function in advanced 
calculus, and, in fact, makes it more precise. In this case the coordinates x! of 
a point of U are functions on U and, by our definition, dx’ assigns to each 
vector X at pe U a number X, x’, its ith component in the natural basis of 
R". In particular ¢6/dx’, dx) = dx'/dx/ = ô$ so we see that dx',..., dx” is 
exactly the field of coframes dual to ô/ôx', ..., /dx". Now if f is a C” 
function on U, then we may express df as a linear combination of 
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dx',..., dx". We know that the coefficients in this combination, that is the 
components of df, are given by df (@/dx') = ôf/ðx". Thus we have 
of. ef 
==, veep dt 
T= 5 le is 


Suppose ae U and X,€ T,(R”). Then X, has components, say, ht, ..., R" 
and geometrically X, is the vector from a to a + h. We have 


atx.) = xs- (zri) E), 


in particular, dx'(X,) = h’, that is, dx' measures the change in the ith coor- 
dinate of a point as it moves from the initial to the terminal point of X,. 
The preceding formula may thus be written 


df (X.a) = (35), dx'(X,) + + (5), dx”(X a). 


This gives us a very good definition of the differential of a function f on 
U c R": dfis a field of linear functions which at any point a of the domain of 
f assigns to each vector X, a number. Interpreting X, as the displacement of 
the n independent variables from a, i.e., a as initial point anda + h as terminal 
point, df (X ,) approximates (linearly) the change in f between these points. 
[Compare this with our earlier discussion in Section II.1 of differentiability of 
a function; the expression above has meaning even if f is not C™, in fact 
exactly when f is differentiable in the (weak) sense of Section II.1.] 


Covector Fields and Mappings 


We shall give further examples of covector fields presently. First, 
however, we must study what happens when we map one manifold to an- 
other. Let F: M > N be a smooth mapping and suppose p € M. Then, as we 
know, there is induced a linear map F,: T,(M) > Trp (N). As we have 
pointed out in (i) at the beginning of this section, F, determines a linear map 
F*: T#,(N) > T#(M), given by the formula 


(1.5) F* (orp )(Xp) T arip (F(X p))- 


In general, F, does not map vector fields on M to vector fields on N. It is 
surprising, then, that given any C’-covector field on N, F * determines 
(uniquely) a covector field of the same class C" on M by this formula. We 
state this as a theorem. 
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(1.6) Theorem Let F: M > N be C® and let o be a covector field of class 
C on N. Then formula (1.5) defines a C’-covector field on M. 


Proof Ifo is the covector field on N, then for any p e M, there is exactly 
one image point F(p) by definition of mapping. It is thus clear that F*(c) is 
defined uniquely at each point of M. Now suppose that for a point pọ € M we 
take coordinate neighborhoods U, œ of pọ and V, y of F(po) so chosen that 
F(U) = V. If we denote the coordinate on U by (x!,..., x”) and those on V 
by (y’,..., y"), then we may suppose the mapping F to be given in local 
coordinates by 

y = f?(xt,..., x")  i=1,...,n. 


Let the expression for o on V in the local coframes be written at q € V as 


Oo, = È aila); > 


i=1 
where ÕL, -.., @% is the basis of T¥(N) dual to the coordinate frames. The 
functions «'(q) are of class C” on V by hypothesis. Using the formula defining 
F*, we see that if p is any point on U and q = F(p) its image, then 
(F*(2)) (Ein) = orio (F (Ein) = X %:(F(P) Or (Fe (Ejp))- 


However, we have previously in Theorem IV.1.6 obtained the formula 


F,(Ejp) = Daa kF(p) > j=1,...,m, 


the derivatives being evaluated at (x*(p),...,x"(p)) = o(p). Using 


~ 


@(E;) = ði, we obtain 
(EHDE) = LadFte)(55) 


As p varies over U these expressions give the components of F*(c) rela- 
tive to w',..., @” on U, the coframes dual to E,,..., Em. They are clearly of 
class C” at least, and this completes the proof. 


The formulas are themselves of some interest and may be used for 
computation, so we shall display them in a corollary. 


(1.7) Corollary Using the notation above, let o = X7- 4;@' on V and 


F*(o) = )'7_, bja on U, where a; and p, are functions on V and U, respec- 
tively, and &', œ” are the coordinate coframes. Then 


F*(@') = ye and i> Late i=1,...,n, 
j=l,...,m. 
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The first formulas give the relation of the bases; the second those of the 
components. If we apply this directly to a map of an open subset of R" into 
R", these give for F*(dy') the formula 


m 


i oy j . 
(1.8) F*(dy') = 2 5x i=1,...,n. 


(1.9) Remark Let U,¢ be a coordinate neighborhood on M, dim M = n, 
and U’ = g(U) = R". Then dx’,...,dx" on U’ are dual to 8/0x',..., 0/0x", 
and we have ~; '(@/dx') = E; so that 9,(E,) = 6/0x', i = 1,...,n. This gives 
for o*(dx') 


<E;, o*(dx')> = <o,(E)), dx!) = òf 
This shows that the coordinate coframes œ!,..., œ" dual to E,,...,£, are 


defined by w/ = o*(dx/), j =1,...,n. 


There is a potential source of confusion in notation here. The coordin- 
ates x!, ..., x" can be considered as functions on U and as such have differen- 
tials dx' defined by 


ix, dx) = Xx, 
the ith component of X in the coordinate frames. In particular <E}, dx‘ = 
E,x' = 64, so that dx!, ..., dx" are dual to E,,..., E„ and therefore dx’ = w, 
i= 1,...,n. Combining this with the formula above gives dx' = ọ*(dx'), 


which is nonsense unless we are careful to distinguish x' as (coordinate) 
function on U c M, on the left, from x! as (coordinate) function on (U) = 
U’ c R", on the right (cf. Remark III.3.2). 


(1.10) Example We may apply Theorem 1.6 to obtain examples of covec- 
tor fields on a submanifold M of a manifold N. Let i: M — N be the inclu- 
sion map and suppose ø is a covector field on N. Then i*(a) is a covector 
field on M called the restriction of o to M. It is often denoted oy or simply v. 
Recalling that for each pe M, T,(M) is identified with a subspace of T,(N) by 
the isomorphism i, , we have for X,€ T,(M) 


ou(X>p) = (i*o)(Xp) = a(i4(X»)) = (Xp). 


The last equality is the identification. 

As an example, let M c R", and let o be a covector field on R", for 
example take o = dx’. Then ø restricts to a covector field oy on M. Note 
that in this example dx! is never zero as a covector field on R", but on M it is 
zero at any point p at which the tangent hyperplane T,(M) is orthogonal to 
the x!-axis. 
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Exercises 


1. Verify properties (i)}-(iii) of V, V*, and V**. 

Let G = Gi(n, R) and define n? covector fields ;;, 1 < i,j < n, on G by 
Oi; = È k=1 Vn 4X,;, where Y = (y;;) is the inverse of X = (x,,). Show 
that these forms are invariant under L,: G > G, left translation by A. 
Further show that {o;;} is a field of frames on G. 

3. Letfi,...,f,,r < n, be C” functions on an open set U of a manifold M. 
Prove that there are coordinates V, w in a neighborhood of pe U such 
that f\,...,f, are among the coordinate functions if and only if 
df,,..., df, are linearly independent at p. 

4. Let F:M-—+N be a C” mapping such that F,:7,(M) -> Tr (N) 
has kernel = (0). If V, Y is a coordinate neighborhood of F(p), show that 
after suitable renumbering of coordinates y', the mapping q— 
(y'(F(q)),..-, Y"(F(q)), the first m coordinates of F(q), m = dim M, 
determines a coordinate map on a neighborhood U of p. 

5. Determine the subset of R? on which o! = x! dx! + x? dx? and 
o? = x? dx! + x! dx? are linearly independent and find a frame field 
dual to o', o? over this set. 

6. Show that the restriction of ø = x! dx? — x? dx! + x3 dx* — x* dx? of 
R* to the sphere S° is never zero on S°. 

7. Show that the set 7 '(M) of all covector fields on M, like the set ¥(M) of 
all vector fields on M is a C®(M) module. Prove also that o e 7 '(M) if 
and only if ø is a C®(M)-linear mapping from ¥(M) to C”(M). 

8. Try to determine a C” manifold structure on T*(M) = |]Jpem TR(M) in 
such a fashion that a covector field o on M is a C” mapping from M into 
T*(M) and so that the natural mapping z taking each o,€ T*(M) to p 
is C” 


2 Bilinear Forms. The Riemannian Metric 


In the case of a vector space V over R a bilinear form on V is defined to be 
a map ®: V x V —> R that is linear in each variable separately, i.e., for 
a, PeR and v, vi, ¥2, W, Wi, WE V, 


O(av, + By, , w) = aP(v,, w) + BO(v, , w), 

O(v, cw, + Bw2) = a®(v, w,) + BO(v, w2). 
A similar definition may be made for a map ® of a pair of vector spaces 
V x Wover R. We will not pursue this generalization at the moment except 


to point out that the map assigning to each pair ve V, ce V* a number 
<v, o>, as discussed in the preceding section, is an example. 


PDF pdfFactory Pro www.fineprint.cn 


184 V TENSORS AND TENSOR FIELDS ON MANIFOLDS 


Bilinear forms on V are completely determined by their n? values on a 


basis e,,...,e, of V. If a; = O(e,e), 1<ij<n, are given and if 
v = J Jie;, w = > we, are any pair of vectors in V, then bilinearity requires 
that 


O(v,w) = } agiw. 
i, j=l 

Conversely, given the n x n matrix A = (@,,) of real numbers, the for- 
mula just given determines a bilinear form ®. Thus there is a one-to-one 
correspondence between n x n matrices and bilinear forms on V once a 
basis e}, ..., € is chosen. The numbers «,; are called the components of ® 
relative to the basis. 

We will mention some special cases which will be of interest to us. A 
bilinear form, or function, is called symmetric if ®(v, w) = ®(w, v), and skew- 


symmetric if ®(v, w) = —©(w, v). It is easily seen that regardless of the basis 
chosen, these correspond, respectively, to symmetric, '4 = 4, and to skew- 
symmetric, ‘A = — A, matrices of components. 


A symmetric form is called positive definite if ®(v, v) > 0 and if equality 
holds if and only if v = 0; in this case we often call ® an inner product on V. 
We shall be particularly interested in this case in the succeeding chapters; a 
vector space with an inner product is called a Euclidean vector space since ® 
allows us to define the length of a vector, ||v|| = (®(v, v))'?, and the angle 
between vectors, as was remarked in Section 1.1. 


(2.1) Definition A field ® of C’-bilinear forms, r > 0, on a manifold M 
consists of a function assigning to each point p of M a bilinear form ®, on 
T,(M), that is, a bilinear mapping ®,: T,(M) x T,(M) > R, such that for 
any coordinate neighborhood U, ø the functions «,; = O(E;, E;), defined by 
® and the coordinate frames E,,..., Ep, are of class C". Unless otherwise 
stated bilinear forms will be C”. [To simplify notation we usually write 
P(X p, Y,) for P(X p» Y,)-] 


The n? functions œ;; = ®(E;, E;) on U are called the components of ® in 
the coordinate neighborhood U, g. Properties similar to those of covectors 
hold in this case also. As in (1.2) if ® is a function assigning to each pe M a 
bilinear form, then © is of class C” if and only if for every pair of vector fields 
X,Y on an open set U of M, the function ®(X, Y) is C on U. As in 
Remark 1.3 we have the fact that ® is C*(U)-bilinear as well as R-bilinear: 
fe C”(U) implies ®(fX, Y) = fO(X, Y) = A(X, fY) (Exercise 2). 

Suppose F,: W —> V is a linear map of vector spaces and © is a bilinear 
form on V. Then the formula 


(2.2) (F*)(v, w) = O(F,(¥), F.(w)) 


defines a bilinear form F*® on W. We have the following properties: 
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(i) If ® is symmetric (skew-symmetric), then F*® is symmetric 
(skew-symmetric). 

(ii) If ® is symmetric, positive definite, and F, is injective, then F*® is 
symmetric, positive definite. 


In particular, this latter applies to the identity map i, of a subspace W 
into V. In this case i*® is just restriction of ® to W: 
(i*@)(v, w) = O(i,¥, i,w) = (v, w). 

Now let F: M > N be a C” map and suppose that ® is a field of bilinear 
forms on N. Then just as in the case of covectors this defines a field of 
bilinear forms F*® on M by the formula for (F*®), at every pe M: 


(F*®)(X, > Y,) = P(F,(X p) F,(Y,)). 


We state this in the form of a theorem. 


(2.3) Theorem Let F: M >N be a C” map and ® a bilinear form of class 
C" on N. Then F*® is a C’-bilinear form on M. If © is symmetric (skew- 
symmetric), then F*® is symmetric (skew-symmetric). 


Proof The proof parallels those of Theorem 1.6 and Corollary 1.7 and 
we analogously obtain formulas for the components of F*® in terms of those 
of D. We suppose U, and V, y are coordinate neighborhoods of p and of 
F(p) with F(U) c V. Using the notation of Theorem 1.6 and Corollary 1.7 
we may write palp) = (F*®),(E;,, Ejp) = P(F,(Eip), F4(Ejp))- Applying 
Theorem IV.1.6 as before, we have 

n dysé Š = 
Palp) = > mee D(E sep) , Erp): 


s.t=1 


This gives the formula 


oy’ dy! 
QA B= D aalo) Sis <m 


for the matrix of components (f;,) of F*® at p in terms of the matrix («,,) of 
® at F(p). The functions £;; thus defined are of class C” at least on U which 
completes the proof, except for the statements about symmetry and skew- 
symmetry which are obvious consequences of (i) above. i 


(2.5) Corollary If F is an immersion and © is a positive definite, symmetric 
form, then F*® is a positive definite, symmetric bilinear form. 


Proof All that we need to check is that F*@® is positive definite at each 
peM. Let X, be any vector tangent to M at p. Then F*®(X,, X,) = 
O(F,(X,), F,(X,)) = 0 with equality holding only if F,(X,,) = 0. However, 
since F is an immersion, F,(X,) = 0 if and only if X, = 0. i 
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(2.6) Definition A manifold M on which there is defined a field of sym- 
metric, positive definite, bilinear forms ® is called a Riemannian manifold and 
® the Riemannian metric. We shall assume always that ® is of class C™. 


The simplest example is R” with its natural inner product 


,(X,, Yap) = } œp! where yeaa and =S 
i=1 Ox Ox 

At each point we have ®(0/éx', 6/0x/) = ô; so that the matrix of components 

of ®, relative to the standard basis, is constant and equals 7, the identity 

matrix. It follows that ® is C®. 

Corollary 2.5 enables us to give many further examples. Any imbedded 
or immersed submanifold M of R” is endowed with a Riemannian metric 
from R" by virtue of the imbedding (or immersion) F: M > R”. Thus, for 
example, a surface M in R? has a Riemannian metric. The idea of the 
corollary in this case is very simple: If i: M > R? is the identity and X,, Y, 
are tangent vectors to M at p, then i*@(X,, Y,) = P(i, Xp, ix Yp) = 
D(X, Y,), that is, we simply take the value of the form on X,, Y, considered 
as vectors in R°, using our standard identification of T,(M) with a subspace 
of T,(R*). In particular S?, the unit sphere of R?, has a Riemannian metric 
induced by the standard inner product in R°. If X,, Y, are tangent to S? at p, 
then ©(X,,, Y,) is just their inner product in R°. 

Classical differential geometry deals with properties of surfaces in 
Euclidean space. The inner product ® on the tangent space at each point of 
the surface, inherited from Euclidean space, is an essential element in the 
study of the geometry of the surface. It is known as the first fundamental form 
of the surface. 

We terminate with a few remarks about bilinear forms on an n- 
dimensional vector space V. We continue the numbering from properties (i) 
and (ii) which precede the discussion of mappings. We define the rank of a 
form ® on V to be the codimension of the subspace W = {ve V | O(v, w) = 
0V we M, that is, rank ® = dim V — dim W. This concept is elaborated in 
the exercises. The following facts are often useful: 


(iii) If ® is a bilinear form on V, then the linear mapping gy: V > V* 
defined by <w, g(v)> = ®(w, v) is an isomorphism onto if and only if 
rank ® = dim V. 

(iv) Every bilinear form ® may be written uniquely as the sum of a 
symmetric and a skew-symmetric bilinear form, namely, 


D(v, w) = $[O(v. w) + B(w, v)] + HO, w) — Ow, v)] 


(v) Ifa skew-symmetric form ® has a rank equal to dim V, then dim V 
is an even number. 
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Exercises 
1. Verify (i}(v). 
2. Show that in the C” case Definition 2.1 is equivalent to the following: A 
field of C® bilinear forms on M is a C~(M)-bilinear mapping D: ¥(M) x 


X(M) > C*(M). 

3. Show that the rank of the matrix of components (®(e;,e,)) of a bilinear 
form ® on V” is independent of the basis e,,...,e, and is equal to the 
rank of Ọ. 


4. Let © be a symmetric, positive definite bilinear form on V. Show that (a) 
rank ® = dim V, (b) ® determines a natural isomorphism of V and V*, 
and (c) ®(v, w)? > O(v, v)P(w, w). How does (c) allow us to define to 
within z the angle of two vectors? 

5. Show that the set A(V) of all bilinear forms on a vector space V is a 
vector space and determine its dimension. Prove that a linear map F,: 
V— W of vectors spaces determines a linear map F*: B(W) > B(V) with 
(Gy ° Fẹ)* = F* o G* for composed mappings. 

6. Continuing Exercise 5, let Gi(V) denote the group of (linear) automorph- 
isms of V. Show that GV) acts on @(V). Choosing a basis of V 
determine the action explicitly. 

7. Show that the set of all elements of Gi(V) leaving ® fixed under the action 
of Exercise 6 form a closed subgroup. Is it a regular submanifold of 
GV)? 

8. Let ® be a C” field of bilinear forms and X a vector field on a manifold 
M. Using the one-parameter group action 0, on M and the induced 
mapping 0*on ®, define a “Lie derivative” Ly® of ® with respect to X. 

9. Show that (4, B) = tr ‘AB, the trace of the transpose of A times B, 
defines a symmetric bilinear form on M,(R), the n x n matrices over R. 
Is it positive definite? 


3 Riemannian Manifolds as Metric Spaces 


The importance of the Riemannian manifold derives from the fact that it 
makes the tangent space at each point into a Euclidean space, with inner 
product defined by ®(X,, Y,) (= ®,(X,, ¥,)). This enables us to define 
angles between curves, that is, the angle between their tangent vectors X, 
and Y, at their point of intersection, and lengths of curves on M, as we shall 
see. Thus we may study many questions concerning the geometry of these 
manifolds; this is a large part of the classical differential geometry of surfaces 
in R°. 

As an example we consider the question of defining the length of a curve. 
Let t > p(t), a < t < b, bea curve of class C! ona Riemannian manifold M. 
Then its length L is defined to be the value of the integral 


b dp dp\\"!? 
E ® A 
L=) (ofwa) 4 
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We make several comments here: first, the integrand is a function of t 
alone, so a more precise notation is to denote its value at each t by ®,,,(dp/dt, 
dp/dt), where dp/dt € T,,(M) denotes the tangent vector to the curve at 
p(t). This function is continuous by the continuity of dp/dt and ®. Secondly, 
the value of the integral is independent of the parametrization. In 
Equation IV.(3.7), we gave the following formula for change of parameter: 
dp/ds = (dp/dt)(dt/ds), where t = f(s), c < s < d is the new parametrization. 


Thus 
g dp dp\\'!? Ha dp dp\ (dt\?\'? ds 
I Geta) s=) (a) i 


dp dp 
= ® dt. 
| ‘( (r, dt w) 
In particular, we note that the arclength along the curve from p(a) to p(t), 
which we may denote by s = L(t), gives a new parameter by the formula 


: dp dp\\' 
s= 1) =| (o2. 7) dt 
which implies 


ds dp dp\\'!? ds\? dp dp 
rae 7) CC 
Within a single coordinate neighborhood U, g with coordinate frames 
Eip- Enp, we have ®(E,,,,E;,) = fn A where o(p) = x = (x!,..., x"); 


np? tp>)"“jp 


and the curve is given by g(p(t)) = (x'(¢),..., x"(t)), so that L(t) becomes 
dx’ dx 
s= =) fat a dt a) 


This leads to the frequently used abbreviation 


= 2 Gi (x) dx! dx! 


for the Riemannian metric in local coordinates. [This formula can be in- 
terpreted (later) in terms of multiplication of tensors.] 

We note that in the case of a curve in R” (with its standard inner pro- 
duct), say p(t) = (x'(t), ..., x"(t)) with a < t < b, then we have the familiar 
formula for arc length 


7 if (se) r 


We have used ®(6/0x', d/0x/) = ô; and dp/dt = )%_, x'(t) d/0x' in our 
definition to obtain this. 
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Of course, it is not necessary to assume the curve of class C!; weaker 
assumptions will do. In particular, we may suppose it is piecewise of class 
C', which we will denote D!. We will prove the following rather useful 
theorem concerning Riemannian manifolds: 


(3.1) Theorem A connected Riemannian manifold is a metric space with the 
metric d(p, q) = infimum of the lengths of curves of class D' from p to q. Its 
metric space topology and manifold topology agree. 


Proof Since M is arcwise connected, d(p, q) is defined; and from the 
definition it is immediate that d(p, q) is symmetric and nonnegative. It is also 
very easy to check that the triangle inequality is satisfied if we use the fact 
that a curve from p, to p, and a curve from p, to p, may be joined to give a 
curve from p, to p, whose length is the sum of the lengths of the two curves 
which are thus joined. It remains to show that d(p, q) = 0 implies p = q. 

In order to complete the proof we obtain some inequalities. In all that 
follows let p be an arbitrary point of M, U, ọ a coordinate neighborhood 
which has the property that g(p) = (0, ..., 0), and a > Oa fixed real number 
with the property that g(U) > B,(0), me closure of the open ball of radius a 
and center at the origin of R”. We let x!, ..., x" denote the local coordinates 
and g;(x) the components of the metric tensor ® as functions of these 
coordinates. Since these n? functions are C” in their dependence on the 
coordinates and are the coefficients of a positive definite, symmetric matrix 
for each value of x in g(U), then on the compact subset K,, r < a, of R" x R" 
defined by K, = {(x, «| {|x|] < r, |lo|] = 1}, the expression È$ j=; gux) xi"? 
assumes a maximum value M, and a minimum value m, > 0. In fact if m, M 
denote the minimum and maximum corresponding to r = a, we have the 
inequalities 


n 1/2 

O<m<ms< ( >, aul! <M, <M. 
i, j=l 

Moreover, if (B', ..., p") are any n real numbers such that (7, (')?)'? = 

b #0, then replacing each at above by f'/b and multiplying the inequalities 

by b yields: 


n 1/2 
O<mbsmb<(¥ ab “6 < M,b < Mb 
j= 


for every xe B,(0). Now we shall make the assumption that if x, y are any 
points of R” with its standard Riemannian metric (as defined above), then 
the infimum of the lengths of all D! curves in R” from x to y is exactly the 
length of the line segment xy, in other words, it is |y — x|| the Euclidean 
distance from x to y as defined in Section I.1 (see Exercises 5 and 6 for the 
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method of proof). Let p(t), a < t < b, be a D' curve lying in ~ '(B,(0)) = U 
which runs from p = p(a) to q = p(b) and let 
b n 1/2 
L-i | È aeoo] a 
“a i j=1 
denote its length. The last set of inequalities above and the assumption on R” 
imply that for p # q 


0 < mlg(q)| < m, lola) < L < M, | | 


“a 


5 n ; 1/2 
<M| JÈ «| dt. 
“a =1 


i 


We first use these inequalities to complete the proof that d(p, q) is a 
metric on M. Let q' be any point of M distinct from p. Then for some r, 
0 <r<a,q lies outside of p~ '(B,(0)) = U. Let p(t), 0 < t < c, bea curve 
of class D! which goes from p = p(0) to q’ = p(c) and let L be its length. 
There is a first point q = p(b) on the curve which is outside ep *(B,(0)), 
that is, such that p(t) lies inside the neighborhood 7 '(B,(0)) for 0 < t < b, 
but q = p(b) does not; q is the first point of the curve with lo(q)| = r. 
If L denotes the length of the curve p(t), O < t < b, then L < Ľ. From this 
it follows that Ľ > L > mr and, since the curve was arbitrarily chosen, that 
d(p, q) > mr. This means that if q' # p, then d(p, q') #0, so that d(p, q) is 
a metric as claimed. 

In order to show the equivalence of the metric and the manifold topolo- 
gies on M, it is enough to compare the neighborhood systems at an arbitrary 
point p of M; in fact for the manifold topology we need only consider the 
neighborhoods lying inside a single coordinate neighborhood U, ¢ as above. 
Thus we must show that each neighborhood V, = y~ ‘(B,(0)) = U of the 
point p contains an ¢-ball, S,(p) = {q € M|d(p, q) < £}, of the metric topology, 
and conversely. This will follow from the inequalities we have obtained. In the 
last set of inequalities, if r<a and geg” 1(B,(0)), then ||g(q)|| is the 
Euclidean distance d,(p, q) of p from q, i.e., the distance in R” of (p) = 0 from 
(q). Thus V, = {qe U|d,(p,q) < r}. The left side of the inequality can be 
written as 


0 < md,(p, q) < m,d,(p, q) < L. 
Since d(p, q) is the infimum of such L, we see that V, c S,(p) for r < e/m. 
On the other hand, given V,, r < a, and a D' curve in V, = o~ '(B,(0)), as 
described above, the integral on the right-hand side of the inequality is its 


Euclidean length Lg, i.e., the length of g(p(t)), a < t < b. Thus the right-hand 
side of the inequality may be written as 


L<M,Ly < MLg. 
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According to Exercises 5 and 6, d,(p, q) is the infimum of the lengths of such 
curves from p to q, just as d(p, q) is the infimum of L for such curves. Hence, 
Sp) < V, if e < Mr. Thus the S,(p) and V, form equivalent neighborhood 
systems at p. This completes the proof of the theorem except for the unproved 
assertion about R” (essentially this theorem itself in R”), which is left to the 
exercises. | 


As we have mentioned, the existence of a Riemannian metric on a mani- 
fold provides an important ingredient to the study of manifolds from a 
geometric point of view, allowing us to introduce on such spaces many 
concepts of Euclidean geometry such as distances, angles between curves, 
areas, volumes, and—less obviously—straight lines, or geodesics. For one 
way of characterizing a straight line in Euclidean space is that the length of 
any segment pg on it is exactly the distance d(p, q) between its end points— 
which implies that it is also the shortest curve between any two of its points 
(by Exercises 5 and 6 again). We can, using the metric just introduced, ask 
whether there exist curves on a Riemannian manifold which have this 
property. The answer, with some qualifications, is yes; and the class of 
curves (geodesics) thus isolated has both similarities to and fascinating dif- 
ferences from straight lines in Euclidean geometry. For example, if S? is the 
unit sphere in R°, with the induced Riemannian metric, then great circles are 
the geodesics: they indeed realize the distance (and are the shortest curves) 
between any two of their points which lie on the same semicircle. Note that 
these geodesics are closed curves in marked contrast to straight lines in 
Euclidean space. 

Two Riemannian manifolds M, and M, (with Riemannian metrics ®, 
and ®,) are said to be isometric if there exists a diffeomorphism 
F:M,-—M), such that F*®, = ®,. Clearly such an isometry is also an 
isometry of M, and M, as metric spaces, that is, d, (F(p), F(q)) = d,(p, q) in 
the metrics defined above. It is true, but not easy to prove, that a converse to 
this statement holds (see Kobayashi and Nomizu [1, Theorem 3.10, p. 169]). 

The geometry, including geodesics, lengths of curves, areas, and so forth, 
depends very much on the Riemannian metric of M. For example, the sphere 
S? as an abstract manifold is diffeomorphic to many surfaces in R*, of which 
the unit sphere is only one possibility, another being a standard ellipsoid E 
in R°: 


xe R? 


E= + 


(ey (x*)? (x?) 
a? i b? c =l 


Here the geodesics of the induced Riemannian metric, unlike great circles on 
the unit sphere S*, are not closed curves in general (see Hilbert and Cohn- 
Vossen [1, p. 222-4]). Thus E and S? are diffeomorphic but not isometric, 
that is, they are equivalent as differentiable manifolds but not as Riemannian 
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manifolds. An important question is to decide whether or not two given 
Riemannian manifolds are, in fact, isometric; and if so, in how many ways. 
One of the questions which led Gauss to some of his great discoveries may 
have been a very practical one: Is there any isometry possible between a 
portion of the surface of a sphere (the earth) with the metric mentioned above 
and a portion of the Euclidean plane with its standard metric? Or equiva- 
lently, can we construct a map of some part of the earth’s surface which does 
not distort distances and/or angles? We shall come back to this question in a 
later chapter. 


Exercises 


1. Using spherical coordinates (6, g) on the unit sphere p = 1 in R?, deter- 
mine the components (g;;) of the Riemannian metric on the domain of 
the coordinates (U = S? minus the north and south poles). 

2. Similarly, find g,; for T? = S' x S' using coordinates (0, œ) and the 
imbedding 


(0, p) > ((a + b cos p) cos 8, (a + b cos ø) sin 9, b sin g) 


in R? given by rotating a circle of radius b, center at (a, 0, 0), a > b, 
around the x?-axis. 

3. Show that to each vector field X on a Riemannian manifold there corre- 
sponds a uniquely determined covector field o, by (iii) of the previous 
section. Show that this is actually an R-linear map. Is it C°(M)-linear? 
Use your arguments to show that if f e C*(M), then there is a unique C® 
vector field, grad f, which corresponds via the metric to o = df. 

4. Using the results of Exercise 8 of Section 2, show that 0, is a 1-parameter 
group of isometries of a Riemannian manifold M if and only if the Lie 
derivative of the Riemannian metric ® with respect to the infinitesimal 
generator of 6, is zero, Ly ® = 0. 

5. Let x(t), 0 < t < 1, be a curve of class D’ in R” from x(0) = (0, 0, ..., 0) 
to x(1) = (a',..., a"). Assume, for simplicity, that ||x(t)| > 0 for t > 0 
and write x(t) = A(t)u(t), where A(t) = ||x(t)|| and u(t) is a unit vector. 
Show that ||x(¢)||? = (A(t)? + (A(d))?|/a(¢)||?, and use this to prove that 
the length of the curve is at least ||x(1) — x(0)||, the distance from the 
origin to a = (a’,..., a"). 

6. Show that the simplifying assumption that ||x(t)| > 0 for t>0 in 
Exercise 5 may be removed by considering only the portion of the curve 
outside a small sphere around the origin, whose radius we then let tend 
to zero. Use these results to establish that in R” the infimum of the length 
of curves of class D! joining two points x and y is ||x — y|| so that the 
metric defined on R” by Theorem 3.1 and the standard Riemannian 
structure is the usual one. 
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7. Use the Riemannian metric to define the cosine of the angle between 
intersecting curves. Apply this to the coordinate curves in Exercises 1 
and 2. 


4 Partitions of Unity 


We have mentioned, but not proved, that there is no nonvanishing 
C®-vector field on S?. It follows from the exercises at the end of the last 
section that the same is true for covector fields on S°. In view of this non- 
existence, it might occur to ask whether on an arbitrary manifold M it is 
possible to define a C” positive definite, bilinear form, that is, is every 
manifold Riemannian? This question and a number of others may be 
answered using the notion of a partition of unity. Before discussing this 
concept we need some preliminary definitions and lemmas. 

A covering {A,} of a manifold M by subsets is said to be locally finite if 
each pe M has a neighborhood U which intersects only a finite number of 
sets A,. If {A,} and {Bp} are coverings of M, then {B,} is called a refinement of 
{A,} if each Bẹ < A, for some a. In these definitions we do not suppose the 
sets to be open. Any manifold M is locally compact since it is locally 
Euclidean; it is also o-compact, which means that it is the union of a count- 
able number of compact sets. This follows from the local compactness and 
the existence of a countable basis P,, P2, ... such that each P; is compact. A 
space with the property that every open covering has a locally finite 
refinement is called paracompact; it is a standard result of general topology 
that a locally compact Hausdorff space with a countable basis is paracom- 
pact. We will prove a version of this adapted to our needs. 


(4.1) Lemma Let {A,} be any covering of a manifold M of dimension n by 
open sets. Then there exists a countable, locally finite refinement {U;,, ~;} 
consisting of coordinate neighborhoods with ọ{U;) = B3(O) for all i= 1, 
2, 3,...and such that V, = o7 1(B"(0)) = U; also cover M. 


Proof Since the lemma is trivial otherwise, we assume that M is not 
compact. We begin with the countable basis of open sets {P;}, P; compact, 
which we mentioned above. Define a sequence of compact sets K,, K2,... as 
follows: K, = P, and, assuming K,,... K; defined, let r be the first integer 
such that r >i and K; c | )5_,P;. Define K;,, by 


Kyi = P UP, UU P, = Pu UP, 


Denote by K;,, the interior of K;,,;it contains K,. Let K_, = Ko = Ø 
and for each i = 0, 1,2,...,we consider the open set (K2 — K;-1) 0 Ay. 
Around each p in this set choose a coordinate neighborhood U,, ,, @p,a lying 
inside the set and such that 9, ,(p)=0 and 9, (U,.,) = B3(0). Take 
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Via = Pp, 1(B"(0)) and note that these are also interior to (Kiso - 
K;_,) 0 A,. Moreover allowing p, « to vary, a finite number of the collection 
of V, , covers Kj, — K,, a closed compact set. Denote these by V, , with k 
labeling the sets in this finite collection. For each i = 1, 2,... the index k takes 
on just a finite number of values; thus the collection V; , is denumerable. We 
renumber them as V, V,,...,and denote by Uy, @ U2, @2,...,the 
corresponding coordinate neighborhoods containing them. These satisfy the 
requirements of the conclusion; in fact for each pe M there is an index i such 
that pe K; but from the definition of U,, V, it is clear that only a finite number 
of these neighborhoods meet K;. Therefore {U,} and also {V} are locally 
finite coverings refining the covering {A,}. 


(4.2) Remark It is clear that it would be possible to replace the spherical 
neighborhoods B"(0) by cubical neighborhoods C;(0) in the lemma. 


We shall call the refinement U;, V;, g; obtained in this lemma a regular 
covering by spherical (or, when appropriate, cubical) coordinate neighbor- 
hoods subordinate to the open covering {4,}. 

Recall that the support of a function f on a manifold M is the set 
supp(f) = {x € M | f (x) # 0}, the closure of the set on which fdoes not vanish. 


(4.3) Definition A C” partition of unity on M is a collection of C”? func- 
tions { f,} defined on M with the following properties: 


{ 
(1) f, => OonM, 
(2) {supp(f,)} form a locally finite covering of M, and 
(3) ¥&, f,(x) = 1 for every xe M. 

Note that by virtue of (2) the sum is a well-defined C™ function on M 
since each point has a neighborhood on which only a finite number of the f,’s 
are different from zero. A partition of unity is said to be subordinate to an 
open covering {4,} of M if for each y, there is an A, such that supp(f;) = 4,. 


(4.4) Theorem Associated to each regular covering {U;, Vi, P} of M there 
is a partition of unity { f such that f, > 0 on V, = ør *(B,(0)) and supp f; = 
o; ‘(B2(0)). In particular, every open covering {A,} has a partition of unity 
which is subordinate to it. 


Proof Exactly as in part (a) of the proof of Theorem II.5.1, we see that 
there is, for each i, a nonnegative C® function g(x) on R” which is identically 
one on B"(0) and zero outside B3(0). Clearly g;, defined by g; = g° g,on U; 
and g; = 0 on M — U,, is C” on M. It has its support in ø; *(B3(0)), is +1 
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on V;, and is never negative. From these facts and the fact that {Vj} is a 
locally finite covering of M we see that 


Go, Na 
Èg, 
are functions with the desired properties. I 


Some Applications of the Partition of Unity 


We shall give several applications which illustrate the utility of this 
concept. The first answers the question raised at the beginning of the first 
paragraph in this section. 


(4.5) Theorem It is possible to define a C? Riemannian metric on every C” 
manifold. 


Proof Let {U;, V,, g;} be a regular covering of M and f; an associated 
C°” partition of unity as defined above. Then g;: U; > B3(0) being a diffeo- 
morphism, the bilinear form ®; = ø*¥¥, Y the usual Euclidean inner product 
on R", defines a Riemannian metric on U; (Corollary 2.5). Since f; > 0 on 
V,, f;®, is a Riemannian metric tensor on V,, is symmetric on U;, and is zero 
outside ø; '(B3(0)). Hence it may be extended to a C*-symmetric bilinear 
form on all of M which vanishes outside g; '(B3(0)) but is positive definite 
at every point of V,. It is easy to check that the sum of symmetric forms is 
symmetric, therefore ® = $. f,®,, defined precisely by 


OX, Y) = E APOL, Yh peM, 


is symmetric. We have denoted by f; ®; its extension to all of M, and we must 
remember that the summation makes sense since in a neighborhood of each 
pe M all but a finite number of terms are zero. However, ® is also positive 
definite. For every i, f; > 0 and each pe M is contained in at least one V;. 
Then f;(p) > 0, and therefore 


0 = 0,(X,. Xp) = > flp)O{(X, » Xp) 
implies that ®(X,, X,) = 0. This means 
0 = o¥Y(X, ’ Xp) = Y(Qj4(X p) Pil Xp) 
But since ¥ is positive definite and ọ is a diffeomorphism, this implies 


X, = 0. This completes the proof. i 


As a second application we consider the following question. Let M be a 
C™ manifold. Then is M diffeomorphic to a submanifold of Euclidean space 
R“ of some sufficiently high dimension N? This is a rather difficult question, 
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particularly if we modify it slightly so as to leave the choice of N less 
arbitrary. For example, is every surface, that is, every two-dimensional mani- 
fold M, imbeddable as a submanifold of R?? [The answer is no; it is known 
that this is not always possible even if the surface is compact.] We shall give 
a partial answer to the question as first posed. 


(4.6) Theorem Any compact C® manifold M admits a C® imbedding as a 
submanifold of R“ for sufficiently large N. 


Proof We let {U;, V,, pi} be a finite regular covering of M ; there exists 
such because of the compactness. We have defined the associated partition 
of unity {/;} using functions {g;}, where g; = 1 on V;, and we shall use here 
these C® functions {g;} on M rather than the (normalized) { f;}. Let gj: U; > 
B3 (0) be the coordinate map. Then the mapping g; y;: U; > B3(0)—mapping 
pE U; to g(p)p(p) = (gi(p)x*(p), -.., gi(P)x"(p)) in R"—1s a C” map on U;, 
taking everything outside ø; '(B%(0)) onto the origin, but agreeing with 9; 
on V,. It may be extended to a C” mapping of M into B3(0) by letting it map 
all of M — U; onto the origin. When we write g;y;, we will mean this 
extension; on V, it is a diffeomorphism to B} (0) so its Jacobian matrix has 
rank n = dim M there. 

We suppose that i = 1,..., k is the range of indices in our finite regular 
covering and let N = (n + 1)k. Define 

F:M >R >R" x- x R™XRX*° XR 


— amm 
k k 


by 
F(p) = (9; (p)e1(p); --- ; 9e(P)@x(P)3 g1 (P), -- -> 9u(P))- 


Then F is clearly C® on M, and in any local coordinates on M the N x n 
Jacobian of F breaks up into k blocks of size n x n followed by a k x n 
matrix so its rank is at most n. However, p e M implies p € V; for some i, and, 
on V, g; = 1, so gip; = 9; and the matrix has rank n. Thus F: M > R” isa 
C® immersion. If it is one-to-one the proof is finished since M is compact 
and Theorem III.5.7 applies. Suppose F(p) = F(q). Then g,(p) = g,(q), 
i = 1,..., k. This implies that g(p)o,(p) = g:(a)p:(q); but since g,(p) # O for 
some i, this means ~,(p) = 9(q) for that i and since g, is one-to-one, we see 
that p = q. Thus F is one-to-one, completing the proof. i 


We remark that it is an obvious disadvantage of this theorem that N may 
be much larger than we would like it; in fact we have no way of giving an 
effective bound on it from this proof. For example, we know that it takes at 
least two coordinate neighborhoods to cover S? (using stereographic projec- 
tions from the north and south poles) and hence k = 2,n = 2 so that N = 6, 
implying that S? may be imbedded in R®. This is obviously not the best 
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possible! Since a sphere with handles (compare Section I.4) may require 
more than two coordinate neighborhoods to cover it, the value of N would 
increase accordingly. 

Another defect of the theorem is that it only applies to a compact mani- 
fold and although such manifolds are important, it would be very nice to 
know that every manifold may be considered as a submanifold of R” for 
some N. Then our intuitive geometric concepts derived from the classical 
study of curves and surfaces in R? could be seen to carry over to arbitrary 
manifolds; in particular, the concept of tangent space T,(M) is given intui- 
tive content just as in Example IV.1.10. Clearly this question has great inter- 
est in manifold theory. The following theorem was proved by Whitney [1] in 
1936 in a paper which is one of the landmarks in the study of differentiable 
manifolds. It is known as the Whitney imbedding theorem. 


(4.7) Theorem Any differentiable manifold M of dimension n may be imbed- 
ded differentiably as a closed submanifold of R?"*?. 


The proof has since been simplified and appears in many recent texts, for 
example, Milnor [2], Sternberg [1], and Auslander and MacKenzie [1] and 
Hirsch [1]. (Whitney later, by more difficult methods, replaced R?"*! by 
R?") 

Our final example of the way in which the ideas of this section may be 
used will be to prove the following “smoothing” theorem: 


(4.8) Theorem Let M be a C” manifold and A a compact subset of M, 
possibly empty. If g is a continuous function on M which is C® on A and e isa 
positive continuous function on M, then there exists aC® function h on M such 
that g(p) = h(p) for every pe A and | g(p) — h(p)| < elp) on all of M. 


In order to prove this we shall need a similar theorem for the case of a 
closed n-ball in R". For convenience we choose the following one (see 
Dieudonné [1] for a proof). 


(4.9) Lemma (Weierstrass approximation theorem) Let f be a contin- 
uous function on a closed n-ball B" of R” and let € > 0. Then there is a 
polynomial function p on R” such that | f(x) — p(x)| < £ on B". 


Another similar but easier approximation lemma for R” which would 
serve equally well here is given in Section VI.8 (Exercise 2). We now proceed 
with the proof of the theorem. 


Proof By definition “g is C® on A” means that there is a C” extension 
g* of g|A which is C” on an open set W’ containing A. Since g — g* = 0 on 
A, which is compact, we can find an open set W, Ac Wc W’ such that 
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\g(p) — 9*(p)| < (p) for pe W. To do this, first find a finite covering of A by 
neighborhoods W, such that W, c W’ is compact. Then K = || W, = W is 
compact and we let ¢’ = inf,..x e(p) > 0. Finally, let W be the open subset of 
|] W, on which |g — g*| < &’. 

Next take a regular covering {U,, ;} which is a refinement of covering 
{W, M — A} of M. On each U; we define a C® function h; as follows: If 
U; c W, then let h; = g*|U;. Otherwise, we use the Weierstrass approxima- 
tion theorem to obtain a polynomial function p; on R” which approximates 
ĝ =g° Q; ' on B3(0) = g(U,) to within e; = inf e(p) on peg; '(B3(0)), and 
then define h; = p;o g;. Thus on each U; we have defined a C” function h; 
which satisfies |h; — g} < e(p) on o7 +(B3(0)). Now let { fi} be the C” partition 
of unity subordinate to our regular covering. For each i we can interpret fih; 
as a C” function on M by taking it to be 0 outside supp f;, which lies in 
o7 1(B4(0)). Thus h = ¥° fih; is defined and C® on all of M. Moreover, since 


5 f=}, 
lg- hl =I} fig —> fhil < VIAG — hd 


Now if p ¢ g; +(B3(0)), then f(p)(g(p) — h(p)) = 0 and if p € o7 '(B3(0)), then 
\g(p) — h(p)| < (p) by our choice of h;. Since >, fp) over the finite set of i for 
which p € o; '(B%(0)) is 1 and f; > 0 everywhere, we see that |g(p) — h(p)| < 
e(p) for all pe M. | 


(4.10) Remark Techniques of this type are very important in bridging the 
gap between the applications to manifolds of topology—where the data are 
usually continuous—and of calculus concepts such as rank of a mapping. By 
using the fact that a manifold is o-compact, for example, and reverting to the 
use of local coordinates, it is possible to prove by methods of this section 
such statements as the following (compare Steenrod [1, p. 251): 


Let X be a vector field which is continuous on all of M and C* on the closed 
subset A. Then X may be approximated arbitrarily closely by a C* -vector field 
Y on M such that X = Y on A. 


Exercises 


1. Iffis a C” function on a closed regular submanifold N ofa manifold M, 
then show that f is the restriction of a C% function on M. 

2. Show that if N is a closed regular submanifold of M, then a C% -vector 
field X on N can be extended to a C*-vector field on M.[Hint: Takea 
covering of N by preferred coordinate neighborhoods of M and use a 
partition of unity subordinate to this covering and to the open set 
M-— N: X can be extended easily within a preferred coordinate 
neighborhood. ] 


PDF pdfFactory Pro www.fineprint.cn 


5 TENSOR FIELDS 199 


3. Show that on a Riemannian manifold every point p lies in an open set 
U, over which we may define a C* field of frames which is orthonormal 
at each point. 

4. Let M be a manifold of dimension k and F: M > R" a C” imbedding of 
M in R". Further, let G(n, k) be the Grassman manifold of k-planes 
through the origin of R”. Show that the map H: M — G(n, k), obtained 
by mapping p to the k-plane through the origin parallel to F, (7,(M)), is 
C*. [This generalizes the Gauss mapping for surfaces in R*.] 

5. Show that if Fo and F, are disjoint closed subsets of a C* manifold M, 
then there exists a C~ function fon M that is 0 on Fo and +1 on F, 
(compare Theorem III.3.4). 

6. Prove Theorem 4.8 under the weaker assumption that A is a closed 


subset of M. 

7. Extend Theorem 4.8 to a mapping into R”, arbitrary N, or into another 
manifold. 

8. Prove that a continuous vector field can be approximated by a C® vector 
field. 


9. Prove that on every C” manifold M it is possible to define a proper C” 
function f: M —> R, i.e., such that the inverse image of a compact set is 
compact. 


5 Tensor Fields 


Tensors on a Vector Space 


It is our purpose in this section to define and study some properties of 
tensor fields on a manifold, especially covariant tensor fields. As in the case 
of covectors and bilinear forms, which are examples of such tensors, we 
begin with a vector space V over a field, in fact over R. 


(5.1) Definition A tensor ® on V is by definition a multilinear map 
P: Vx x Vx V*x-->x VOR 
— a 

V* denoting the dual space to V. r its covariant order, and s its contravariant 
order. (Assume r > 0 or s > 0.) 

Thus ® assigns to each r-tuple of elements of V and s-tuple of elements of 
V* a real number and if for each k, 1 < k < r + s, we hold every variable 
except the kth fixed, then ® satisfies the linearity condition 


, 


D(a, ---, GV, + OV, o) = ADV), 02, Wo e) H DCV, s VES) 


for all x, x'e R, and v,, vpe V (or V*, respectively). (This equation defines 
precisely the meaning of multilinearity.) As examples we have: (i) for r = 1, 
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s = 0, any ge F*, (ii) for r = 2, s = 0, any bilinear form ® on V, and finally 
(iii) the natural pairing of V and V*, that is, (v, o) > <ọ, v> for the case 
r = 1,5 = 1. We have also noted that V and (V*)* are naturally isomorphic 
and thus may be identified so then each ve V may be considered as a linear 
map of V* to R, that is, as a tensor with r = 0 and s = 1. 

For a fixed (r, s) we let 7%(V) be the collection of all tensors on V of 
covariant order r and contravariant order s. We know that as functions from 
V x x Vx V* x> x V* to R they may be added and multiplied by 
scalars (elements of R). (Indeed linear combinations of functions from any 
set to R are defined and are again functions from that set to R, a circum- 
stance that we have used on several occasions.) With this addition and scalar 
multiplication 7%(V) is a vector space, so that if ®,,®,¢7{(V) and 
a, %2 E€ R, then a, P, + x, ,, defined in the way alluded to above, that is, 
by 

(aD, + &2D5)(V1, V2 ,---) = 1D, (Vy, V2,--.) + 2 B2(¥1, V2, ---), 


is multilinear, and therefore is in 7{(V). Thus 7{(V) has a natural vector 
space structure. In this connection we have the following theorem: 


(5.2) Theorem With the natural definitions of addition and multiplication 
by elements of R the set 7 =(V) of all tensors of order (r, s) on V forms a vector 
space of dimension n +S. 


Proof We consider the case s = 0 only, that is, covariant tensors of 
fixed order r, and we let 7"(V), rather than 7%(V), denote the collection of 
all such tensors. If e,,...,e, is a basis of V, then De .7'"(V) is completely 
determined by its n” values on the basis vectors. Indeed by multilinearity if 
we write v; = } a/e,,i = 1, ..., r, then the value of ® is given by the formula 


(5.2) Dàvi... y) = >, ai'ai ab @(e;,,..., e), 
Ties jr 
the sum being over all 1 < j;,...,j, < n. The n” numbers {®{e;, ..., ej,)} are 
called the components of ® in the basis e,, .... e„. We shall justify the termin- 
ology by showing that there is in fact a basis of 7’(V), determined by 
€,,---.@,, With respect to which these are components of ®. It is defined as 
follows: Let Q/1* be that element of 7’(V) whose values on the basis 
vectors are given by 
a +1 if j,=k, for i=1,...,7, 

OF Fe, ory e) = i 
0 if j,#k; for some i, 
and whose values on an arbitrary r-tuple v,,..., v,¢ V is defined by (5.2’); 
whence 


CP? PVs edie Ve) = ajla} o ah. 
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This does define a tensor: multilinearity is a consequence of this formula, 
which is linear in the components of each v;. It is immediate that the n” 
tensors so chosen are linearly independent: If 


then it follows that 


for any choice of the variables v,,...,v,. But from the definition of the 
Q 4" we see, by substituting in turn each combination e,,,..., €x, of basis 
elements as variables, that every coefficient yx, ... x, = 0. 

However, we also find that every ® is a linear combination of these 
tensors. Let @;,...;,= P(e;,,...,e;,) and consider the element 
È pjp T of Z"(V). Applying again the definition of Qi! ~+, we see 
that this tensor and ® take the same values on every set of basis elements, 


hence must be equal. This completes the proof for 7’(V). i 
We remark that an easy extension of the argument using both e,,..., e, 


and its dual basis w', ..., œ” of V* gives the general case 77(V). Since we use 
covariant tensors in most of what follows, we will leave the more general 
treatment to the exercises and to the imagination of the reader. 


Tensor Fields 


It is easy to extend these ideas to manifolds following the pattern we 
established earlier. 


(5.3) Definition A C®-covariant tensor field of order r on a C® manifold 
M is a function ® which assigns to each pe M an element ®, of 7'(T,(M)) 
and which has the additional property that given any X,,..., X,, C”-vector 
fields on an open subset U of M, then ®(X,,..., X,) is a C” function on U, 
defined by O(X,,...,X,) (P) = ®,(X,,,..., Xp). We denote by 7(M) the 
set of all C*-covariant tensor fields of order r on M. 

We have already considered in some detail the case of covector fields, 
r = 1, and that of fields of bilinear forms, r = 2. Just as in these cases, it is an 
immediate consequence of the definition that a covariant tensor field of 
order r is not only R-linear but also C”(M)-linear in each variable. For 
example, if fe C°(M), 


D(X- SXi e X) = (KX ,, ..., Xj 5-5 X,): 
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This is true because it holds at each point p by the R-linearity of ©,; the two 
sides of the equation are equal since equality holds for each pe M. In the 
same way, if fe C*(U), U open in M, the equation holds for ®y , the restric- 
tion of ® to U. (Compare Remark 1.3 and Exercise 2.2.) 

In precisely the same fashion as Section 2 we see that if U, p is a coordin- 


ate neighborhood and Ej, ..., E, are the coordinate frames, then De 7 "(M) 
has components ®(E;,,...,E;,), C” functions on U whose values at each 


pe U are the components of ®, relative to the basis of T,(M) determined by 
E,,..-, Ep. Once more, just as before, the differentiability of ® is implied by 
the differentiability of all the components as functions on the coordinate 
neighborhoods of some covering of M. Finally, it is easy to see that 7”(M) is 
a vector space over R [in fact it is a C°(M) module] since linear combina- 
tions of covariant tensors of order r (even with C® functions as coefficients) 
are again covariant tensor fields. 


Mappings and Covariant Tensors 


A further basic fact which carries over to arbitrary r > 0 from covectors 
and forms is that any linear map of vector spaces F: V > W induces a 
linear map F*: 7"(W) > 7'(V) by the formula 

F*O(v,,..., v,) = O(F, (v1), ---, Fy(v,))- 


In exact analogy with the case r = 2, we find that a C”-map F: M > N 

induces a mapping F*: 7"(N) > 7"(M), defined for ® on N by 
(F*®),(X aps - 00> Xr) = Oren F (Xp) +++ F(X ,p)). 

As we have seen, this is a special feature of covariant tensor fields; its analog 

does not hold for contravariant fields even for 7 ,(M) = ¥(M) (vector 

fields); see Definition IV.2.6. Not only does F* map 7'(N) to T'(M) but it 


maps it linearly; this is an immediate consequence of the definitions (com- 
pare Exercise V.2.6). 


(5.4) Definition We shall say that De 7"(V), V a vector space, is sym- 
metric if for each 1 < i, j < r, we have 

(Yi s Vig ees Vj creo Ve) = DV, ey Vis eee Yio eres v,). 
Similarly, if interchanging the ith and jth variables, 1 < i, j < r, changes the 
sign, 
a y) = — OP jg cag Vpn ices Vises He) 


MV (2025 Vises Yo e 


then we say ® is skew or antisymmetric or alternating ; alternating covariant 
tensors are often called exterior forms. A tensor field is symmetric (respec- 
tively, alternating) if it has this property at each point. 
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The following generalization of Theorems 1.6 and 2.3 summarizes these 
remarks: 


(5.5) Theorem Let F: M > N be a C® map of C® manifolds. Then each 
C®-covariant tensor field ® on N determines a C®”-covariant tensor field F*® 
on M by the formula 


(5.5') (F*D) (Xt ps o Xep) = Dep (F(X ip) o F(X ep). 


The map F*: 7 "(N) > F'"(M) so defined is linear and takes symmetric (alter- 
nating) tensors to symmetric (alternating) tensors. 


We leave the proof as an exercise. Note that (5.5’) is the same as 
formula (2.2). [Warning: under composition we have (F o G)* = G* o F*.] 

It is also clear how to extend to the case of arbitrary order r the formula 
(2.4) for components of F*® in terms of those of ® and the Jacobian of F in 
local coordinates. The same method can also be used to derive formulas for 
change of components relative to a change of local coordinates (for r = 1 
see Corollary 1.7). Basically, these formulas are all consequences of the mul- 
tilinearity at each point of M. 


The Symmetrizing and Alternating Transformations 


In order to pursue some of these questions somewhat further, we return 
to the case of a covariant tensor on a vector space V. First note that if ®, and 
®,¢.7'"(V) are symmetric (respectively, alternating) covariant tensors of 
order r on V, then a linear combination «@, + B®,, a, BE R, is also sym- 
metric (respectively, alternating). Thus the symmetric tensors in 7"(V) form 
a subspace which we denote by ’(V) and the alternating tensors (exterior 
forms) also form a subspace /\’(V). These subspaces have only the 0-tensor 


in common. 
Next let o denote a permutation of (1,...,r) with (1,...,r)—> 
(o(1), ..., o(r)). We know that any such permutation is a product of permu- 


tations interchanging just two elements (transpositions). Although this 
representation is not unique the parity (evenness or oddness of the number 
of factors) is. We let sgn o = +1 if o is representable as the product of an 
even number of transpositions and sgn o = — 1 otherwise. Then, o > sgn c 
is a well-defined map from the group of permutations of r letters S, to the 
multiplicative group of two elements + 1. It is even a homomorphism as we 
check at once from the definition. The only statement not obvious is the one 
concerning the independence of the parity of the particular decomposition of 
d into a product of transpositions (for a proof see Zassenhaus [1]). 

In the light of these facts we see that our original definitions may be 
restated in the following equivalent form: De.7"(V) is symmetric if 
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D(Voc1), +++» Yorn) = P(v,,.-.,¥,) for every v,,...,¥, and permutation o, and 
is alternating if P(v,(1),-.-5 Yoq) = SEN oP(V,,...,V,) for every v,,...,v, and 
permutation ø. 


(5.6) Definition We define two linear transformations on the vector space 
TV), 


symmetrizing mapping =F: T'(V)> F'(V), 
alternating mapping A: TV) F'(V), 


by the formulas: 


1 
(FOY... Y) = a X @(voc1) 5 «+» Yow) 


and 
1 
(S®)(v,,...,¥,) = T > sgn GD(Va(1), «+ +5 Yoh 


the summation being over all o e S,, the group of all permutations of r 
letters. 


It is immediate that these maps are linear transformations on 7'"(V), in 
fact © > °, defined by 


©*(v,, oe v,) = O(V,(1) perea Yan) 


is such a linear transformation; and any linear combination of linear trans- 
formations of a vector space is again a linear transformation. We have the 
following properties. 


(5.7) Properties of £ and S: 


(i) £ and ¥ are projections, that is, £? = £ and P? = F; 

(i) A(T) = NUV) and S(7'0) = EV); 

(iii) © is alternating if and only if 4® = ®; 
® is symmetric if and only if SỌ = È; 

(iv) if Fy: V—W is a linear map, then £ and S commute with 
F*: J'(W) > F'(V). 


All of these statements are easy consequences of the definitions. We shall 
check them only for £, the verification for Y being similar. They are also 
interrelated so we will not take them in order. First note that if ® is alter- 
nating, then the definition and (sgn o)? = 1 imply 


O(v,, ---, V,) = sgn GD(V,,1), ---5 Yom) 
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Since there are r! elements of S,, summing both sides over all o € G, gives 
® = AD. On the other hand if we apply a permutation < to the variables of 
(v;,...,¥,) for an arbitrary De .7'(V), we obtain 


1 
ADV) s Veer) = = D sgn 6O(Ve.1) > -s Vore) 
= G 


Now sgn is a homomorphism and sgn 7? = 1 so that sgn o = sgn ot sgn t. 
From this equation and since ot runs through ©, as o does, we see that the 
right side is 


1 
sent 2 Sgn otỌ(Yna) > -s Yoro) = SEN TA D(V,,..., Yr) 


and ./® is alternating. This shows that 4 (7"(V)) = A'(V). If ® is alter- 
nating, every term in the summation defining .® is equal, so 4P = P. Thus 
s is the identity on A'(V) and #(7"(V)) > /\’(V). From these facts (i}(iii) 
all follow for £. Statement (iv) is immediate from the definition of F*, for we 
have 


F*®D(v,1) 5 +++) You) = O(F (Voy) «++ F (Vo) 


Multiplying both sides by sgn ø and summing over all ø gives—if we use the 
linearity of F*—.o(F*®)(v,, ..., v,) on the left and F*(/®)(v,,..., v,) on the 
right. 

Both of these maps s% and Z can be immediately extended to mappings 
of tensor fields on manifolds—with the same properties—by merely apply- 
ing them at each point and then verifying that both sides of cach relation 
(i}(iv) give C® functions which agree pointwise on every r-tuple of 
C®-vector fields. We summarize (without proof): 


(5.8) Theorem The maps of and S are defined on 7"(M) (M a C” mani- 
fold and 7'(M) the C®-covariant tensor fields of order r) and they satisfy 
properties (5.7), (i}(iv), there. In the case of (iv), F*: 7"(N) > F"(M) is the 
linear map induced by a C” mapping F:M > N. 


Exercises 


1. Assume that dim V > 1 and that when r = 2 we have 7'(V)= NV) © 
¥"(V) but’ that this is in general false if r > 2. 

2. Show that /\’(V) contains only the tensor 0 when r > dim V. 

3. Let &(V) denote the space of all linear transformations on V. Show that 
it is a vector space over R of dimension equal to (dim V} and that it is 
naturally isomorphic to the space of all bilinear maps of V x V* to R. 
[Hint: To a linear transformation A: V> V we associate the bi- 
linear map (v, g) > <Av, o> on V x V* to R] 
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4. Give a definition of a C% field of linear transformations on M and check 
that its property of being C® can be defined in terms of local coordinates 
or in terms of ¥(M). 

5. If V is a vector space with an inner product, then there is a natural 
isomorphism of V to V* (compare (iii) of Section 2). Show that this 
determines an isomorphism of 7*(V) and .7"**(V) and extend to tensor 
fields on C” manifolds. 

6. If@is a C”-covariant tensor field of order r on a C” manifold M, show 
that @(X,,...,X,) is a C*(M)-linear function from ¥(M) x- x 
X(M) to C”(M). Conversely, show that each such function determines 
an element of 7’(M) as we have defined it. 

7. Let T:V x+ x V> V be an R-linear function ofr vectors with values 
in V, that is, T(v,,...,¥,) is in V and linear in each variable. Define 
components in this case and extend this object to a field on a manifold in 
the manner of Exercises 3 and 4. 

8. As in the case of vector fields on manifolds, a tensor field, say 
be 7'(M), is a function assigning to each pe M a covariant tensor ®, 
on T,(M), that is, a function from M to the set W = em T"(T,(M)). 
Try to define the structure of a C”? manifold on W such that (1) the 
natural mapping x: W > M taking ®, to p for each ®, € 7"(T,(M)) is 
C” and (2) covariant tensor fields on M are exactly the C” mappings 
®: M > W satisfying xo ® = identity (on M). 


6 Multiplication of Tensors 


Except for a few of the exercises, we will continue to restrict our attention 
to covariant tensors in the remainder of this chapter and in the next. Thus V 
will denote a vector space and M a C® manifold, as before. We have seen 
that both 7'(V) and 7"(M) are vector spaces over R. In the case of tensor 
fields, 7'(M) has also the structure of a C*(M)-module. We agree by 
definition that 7 °(V) = Rand 7 °(M) = C*(M). Having made these con- 
ventions, recall that our viewpoint is to define tensors as functions to R, a 
field, in the case of 7 '(V) and functions to C®(M), an algebra, in the case of 
Z*(M). In either case it is appropriate to discuss products of such functions. 
Just as functions from a set to an algebra can be multiplied in a natural 
way—using the algebra product of their values—to give new functions of the 
same type, so can we hope to multiply tensors. As usual we begin with the 
vector space Case. 


Multiplication of Tensors on a Vector Space 


Suppose V is a vector space and ge F"(V), y € 7 *(V) are tensors. Their 
product is easily seen to be linear in each of its r + s variables so we make 
the following definition: 
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(6.1) Definition The product of pọ and w, denoted p ® y is a tensor of 
order r + s defined by 


9 WV iy Dens Vee Ven a eo Was) = OV goes VU tis eeo Vena) 


The right-hand side is the product of the values of p and w. The product 
defines a mapping (9, Y) > Q y of FJ"(V) x FT(V) > T"*(V). 


(6.2) Theorem The product J'™V)x T(V)>T'*(V) just defined is 
bilinear and associative. If w',...,@" is a basis of V* = 7 '(V), then 
fo! ®@ ---@ wi over all 1 < i... i, <n is a basis of Z"(V). Finally, if 
F,: W >V is linear, then F*(p © y) = (F*@) @ (F*y). 


Proof Each statement is proved by straightforward computation. To 
say that @ is bilinear means that if «, B, are numbers g,, p, € 7'"(V) and 
ye T(V), then (xp, + Bi.) © Y = alp: @ Y) + B(p2 © Y). Similarly for 
the second variable. This is checked by evaluating each side onr + s vectors 
of V; in fact basis vectors suffice because of linearity. Associativity, 
(9 © W) ®©6 = 9 @ (W @ 9), is similarly verified—the products on both 
sides being defined in the natural way. This allows us to drop the paren- 
theses. To see that w ® --- @ w" form a basis it is sufficient to note that if 


e;,-.-,@, is the basis of V dual to w',...,@", then the tensor Q'' 
previously defined is exactly w'!@-:-@ œ". This follows from the two 
definitions: 


a 0 if (easi) Aice) 
(e;, i.) \1 if (i, i) = (ji eej) 
and 
w! O: @ olej, -.-, &;,) = 0”? (e;,)o(e;,) = w'2(e;,) = 5 6% OF 
which show that both tensors have the same values on any (ordered) set ofr 


basis vectors and are thus equal. 
Finally, given Fẹ: W > V, if w,,...,W,4,¢ W, then 


(F*(9 @ YW -> Wes) = 9 @ WF alwi), Fa(W,+5)) 
= o(F,(w, ), s." F (w, YF (w+ 1), EERI F (Ww, +s) 
= (F*ọ) & (F*y )(wi, tss Wris) 
which proves F*(ọ © y) = (F*ọ) ® (F*w) and completes the proof. | 
A more sophisticated but no more general way to state this theorem is 
derived from the following observations. First we take the direct sum over R 
of all of the tensor spaces, beginning with 7 °(V) = R. We denote it by 


T(V), 
TFV=FWVWOT'VO: O@7'VE--. 


PDF pdfFactory Pro www.fineprint.cn 


208 V TENSORS AND TENSOR FIELDS ON MANIFULVSD 


We identify each 7'"(V) with its (natural) isomorphic image in 7 (V). An 
element ọ of 7 (V) is said to be of order r if it is in 7"(V), and every element 
@ of 7 (V) is the sum of a finite number of such ø, which we call its 
components. Thus ġe 7 (V) may be written uniquely @ = gi + + ir, 
where gic T'(V) and i, < i <`] < i. If 6, ~e 7 (V), then they may be 
added componentwise, that is, by adding in 7"(V) any terms in 7"(V). They 
may be multiplied by using Q, extending it to be distributive on all of 7 (V). 
This makes 7 (V) into an associative algebra over R called the tensor alge- 
bra. It contains R = 7 °(V), has 1 as its unit, and is infinite-dimensional. The 
contents of Theorem 6.2 (even a little more) can be written: 


(6.2) F(V) = Yo FV) (direct) is an associative algebra (with unit) over 
R = FV). It is generated by F (V) and 7 \(V) = V*, the dual space to V. 
Any linear mapping F,: W > V of vector spaces induces a homomorphism 
F*: J(V) + F(W) which is (i) the identity on R and (ii) the dual mapping 
F*: V* > W* on T '(V). Together (i) and (ii) determine F* uniquely on all of 
T(V). 

Multiplication of Tensor Fields 


Now we turn briefly to the case of tensor fields on a manifold M. If 
ye F"(M) and y e 7*(M), then we may define g ® y on M by defining it at 
each point using the definition for tensors on a vector space, that is, (p @ y), 
is defined to be the tensor gy, @ Y, of order r + s on the vector space T,(M). 
Since this defines a covariant tensor of order r + s on the tangent space at 
each point of M, it will define a tensor field—if it is C”. Now in local 
coordinates the components of g ® y, according to the definition just given, 
are the functions of the coordinate frame vectors 


o @ Y(Ein -1 Erg) = lE e Ei WE ++ Eins) 


over the coordinate neighborhood. The right-hand side is the product of two 
C?” functions, components in local coordinates of g and y, and thus the left 
side is C? as hoped. We have an appropriate version of Theorem 6.2 for this 
case. 


(6.3) Theorem The mapping 7"(M) x 7*(M) > 7'**(M) just defined is 
bilinear and associative. If w', ..., œ" is a basis of 7 *(M), then every element 
of T'(M) is a linear combination with C” coefficients of {w' @ ++: @ œ" 
1<i,,...,i,<n} If F:N—+M is a C” mapping, pe 7'(M) and 
we T*(M), then F*(p Q Y) = (F*o) @ (F*), tensor fields on N. 


Proof Since two tensor fields are equal if and only if they are equal at 
each point, it is only necessary to see that these equations hold at each point, 
which follows at once from the definitions and the preceding Theorem 6.2. 
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In general we do not have a globally defined basis of 7 '(M), that is, 
covector fields œw, ..., œ", which are a basis at each point. However, we do in 
R", from which the following corollary is obtained by applying the theorem 
to a coordinate neighborhood U, @ of M. Let E}, ..., E, denote the coordin- 
ate frames and o',...,@" their duals, that is, E, = 6,1(6/éx') and 
w = 6*(dx’). 


(6.4) Corollary Each ye 7'"(U), including the restriction to U of any cov- 
ariant tensor field on M, has a unique expression of the form 


p — Doo È a, O R0 Bar, 


where at each point of U, a;, ...;, = @(E;,,..., E,,) are the components of ọ in 
the basis {«"! @ +- ® w'} and are C® functions on U. 


Exterior Multiplication of Alternating Tensors 


For each r > 0 we have defined the subspace /\"(V) = 7"(V) consisting 
of alternating covariant tensors of order r; it is the image of 7'(V) under the 
linear mapping s, the alternating mapping. We define /\°(V) to be R, the 
field. Then A\°(V) = 7°(V) = R and NHV) = 7'(V) = V*, but /A\'(V) is 
properly contained in 7'(V) for r > 1 (Exercise 5.2). We see, therefore, that 
the direct sum /\(V) of all the spaces /\"(V) is contained in 7 (V) as a 
subspace: 


AP =A ANa ANANS 
c T(V) T(V) T?(V) a = 7 (V) 


Although A (V) is a subspace of F (V), it is not a subalgebra. For even if 
pe /\'(V) and we A'(V), it may be shown by example (Exercise 1) that 
ọ ® w may very well fail to be an element of /\’**(V); thus the tensor 
product of alternating tensors on V is not, in general, an alternating tensor 
on V. We know, however, that each tensor determines an alternating tensor, 
its image under ./. This fact enables us to define another multiplication for 
alternating tensors that is extraordinarily useful. 


(6.5) Definition The mapping from /\"(V) x AV) > /\'*5(V), defined 
by 


e-+] ooy 


is called the exterior product (or wedge product) of @ and w and is 
denoted pa w. 
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(6.6) Lemma The exterior product is bilinear and associative. 


Proof Bilinearity is a consequence of the fact that the product is defined 
by composing the tensor product, a bilinear mapping from N) x ACY) 
to 7’*5(V) with a linear mapping ((r + s)!/r! s!).#. 

To show that the product is associative we first prove a property of the 
alternating mapping «/. Suppose ge 7"(V), We T(V), and Oe 7 (V). 
Then we show that 


Alo @W @ 0) = A(A (9 Q Y) @ 0) = Ap @ AY @ A). 


For this purpose let G=G,,,,, denote the permutations of 
(1,2,...,7 +s + t) and GS’ the subgroup which leaves the last t integers 
fixed, Denote by ¢ > 0’ the isomorphism of the permutation group ©,,, 
of (1,2,...,r + s) onto S’. We then have 


AD @ y) & OV isee Vrste) 


1 
( È Sgn GP(Ve(1),+- +> V(r) WY otr + 1o verve) 


ES)! Noé 


` OY 45415 FEER Vst) 


1 , 
= (+s)! X sgn o'oa -Wor + 1)9 ++) 
‘aes’ 


‘ WAV ts+ ijsres » Yor +s+) 
Hence, 
AAE @ Y) @ Ois Verse) 


1 1 
~(rtstoi(r+s)! 


. y È sgn TO PVig(1)> +% MY + jaers JO on + s+ 4 .). 


teS a’ eS’ 


Reversing the order of summation, i.e., summing over 7, yields 


1 
(r +s)! » ae @ Y ® OV ips +60 Vrts+r) 
‘oes 


=A IYO ON¥1,.- +9 Ve+s42) 


since the sum on the left contains (r + s)! identical summands. Thus, we 
obtain 


Alo Q Y @I=A(A(o @ Y) Q O) 
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The second equality is proved in the same way. If g, y, @ are in the subspaces 
AN), A), AV), respectively, then by definition 


(r rey y 


paw = Alo Q y) 


and 


(r +s+t)! 


(r + s)!tl A((prw)® 0). 


(pAW)AG = 


From this and a similar expression in the other order of associating terms we 
obtain the associativity of the exterior product 


(PAW)AB = pa (Y ^0). I 


The following relation is an immediate consequence of the proof, which 
allows us to write exterior products without parentheses. 


(6.7) Corollary Let p;e /\"(V), i = 1,..., k. Then 


ritr t'er)! 
Orrgrrng = 4 ETR s) Ap @ POO gy). 
1: 2° ke 


Lemma 6.6 makes it possible for us to give /\(V) the structure of an 
associative algebra over R; we define the product /\(V) x AW) > A) 
simply by extending the exterior product to be bilinear, so that the distribu- 
tive law holds. This is possible in only one way: Suppose that g, y e /\(V). 
Then 


p=pi t tn, GEA), Maw tooth, We AV) 


and we define 


(68) Corollary /\(V) = A\°(V)@ AV) A?) with the exter- 
ior product as defined above is an (associative) algebra over R = /\°(V) 


The algebra /\(V) is called the exterior algebra or Grassman algebra over 
V. Unlike the tensor algebra 7 (V), of which it is a subspace (but not a 
subalgebra), it is finite-dimensional. To see this we determine a basis of 
/\(V) as a vector space. For this we need the following lemma, which is 
important in its own right. 
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(6.9) Lemma If ge /\'(V) and we /\°(V), then 
erw =(-1 Yng. 
Proof This is equivalent to showing that 
Alo @Y) =(— IAY o) 


To prove this equality we note that 


A (p © W)(v1, tees V, +5) 


1 
)! > Sgn FP(Va(1) > -> Vor) W (Yor +1) press Va(r+s)) 
o 


E “ir +s 
= 7 ! >» SEN OW(Voir 41) > +++ Voet) Yoa) sees Yair) 
(r +s)! 4 
If t is the permutation taking (1, ..., r,r + 1,...,7+ s)to(rt+1,...,r+s, 
1,...,7), then we may write, using sgn o = sgn ot sgn T, 


A (o & w)(V1, ESER V+) 


= (r 4 s)! x sgn OT sgn THY aa) e Varis) PWVors + bp ca Vor +s)) 
g 


= sgn tA (Y Q @)(V;,---, Yrs) 


Since it is easily checked that sgn t = (— 1)", this gives the relation ọ ^ y= 


(=I YA g. 
(6.10) Ten If r A dim V, then /\'(V) = {0}. For O<rx< n, 
dim \'(V ). Let w!,..., @" be a basis of \' (V). Then the set 


eee (=i, <i < <i, <n) 
is a basis of /\’(V) and dim A (V) = 2". 


Proof Let e;,..., €, be any basis of V. If ọ is an alternating covariant 
tensor of order r> dim V, then on any set of basis elements 
p(e;,, --., €) = 0. For some variable ei, is repeated and interchanging two 
equal variables both changes the sign of g on the set and leaves it 
unchanged—the same argument one uses to show that a determinant with 
two equal rows is zero. Since all components of ọ are zero, ọ = 0 so 
N) = {0} 

Suppose that 0 < r < n and that @,..., @, is the basis of V* = A (V) 
dual to e,,...,e,. Since .« maps 7”(V) onto A'(V), the image of the basis 
lo @ --- @ wit of Z"(V) spans /\’(V). We have 


r! A (wo R @ a") =a vn Ao". 
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Permuting the order of i,, ..., i, leaves the right side unchanged, except fora 
possible change of sign according to Lemma 6.9. It follows that the set of (7) 
elements of the form œa =- aœ” with 1 <i, <i, <7 <i, <n span 
/\'(¥). On the other hand, they are independent. For if we suppose that 
some linear combination of them is zero, say 


Y a @' naw’ = O, 
1S <i, 


then its value on each set of r basis vectors must be zero. In particular, given 
ki < < k,, we have 


0 = (Da, np OAA wi)(e,,..., €x), 


which becomes a,...,,=0 by virtue of the formula of Corollary 6.7 
combined with the fact that w‘(e,) = ôi for 1 < i,k < n. By suitable choice 
of kı <-+::<k,, we see that each coefficient must be zero; therefore the 
given set of elements of /\"(V) is linearly independent and a basis. 

To complete the proof we note that 


dim /\(V) = > dim \(V) = F (') = 2" I 


r=0 \Y 


The following theorem is an immediate consequence of Theorem 6.2, the 
fact that of o F* = F* o s, and the definition of exterior multiplication. 


(6.11) Theorem Let V and W be finite-dimensional vector spaces and 
F,: W >V a linear mapping. Then F*: F (V}—> 7 (W) takes /\(V) into 
/\(W) and is a homomorphism of these (exterior) algebras. 


The Exterior Algebra on Manifolds 


It is evident from what we have seen above that all of these ideas extend 
to alternating tensor fields on a C” manifold M. We introduce the following 
terminology: 


(6.12) Definition An alternating covariant tensor field of order r on M 
will be called an exterior differential form of degree r (or sometimes simply 
r-form). 


The set /\’(M) of all such forms is a subspace of 7’(M). The following 
two theorems are immediate consequences of what has been done above and 
their proofs will be left to the reader. We let M, N be manifolds and 
F:M—N bea C” mapping. 
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(6.13) Theorem Let /\(M) denote the vector space over R of all exterior 
differential forms. Then for ge/\"(M) and we /\(M)_ the formula 
(PAW)p = PpAW, defines an associative product satisfying PAW = 
(— 1Y a @. With this product, /\(M) is an algebra over R. If fe C®(M), we 
also have (fo)aw = f(prw) = orf). Ifa',..., @" is a field of coframes 
on M (or an open set U of M), then the set {œn Aw 
1 <i, <i, < <i, < m isa basis of N(M) (or /\"(U), respectively). 


(6.14) Theorem If F:M—N is a C” mapping of manifolds, then 
F*: N(N) > (\(M) is an algebra homomorphism. 


We shall call A (M) the algebra of differential forms or exterior algebra 
on M. 


Exercises 


1. By constructing an example show that tensor products are not commu- 
tative and that p, w symmetric (respectively alternating) does not imply 
ọ @ y is symmetric (respectively, alternating). 

2. Let X(V) denote the subspace of 7 (V) consisting of all o € 7 (V) whose 
constituents are symmetric: L(V) is the image of £°(V) ® ='(V) ® 
+ ® L(V) @--- in Z (V) under the natural injection defined by inclu- 
sion of L’(V) in 7"(V). Define an associative multiplication in X(V) by 
analogy with that for /\(V) and prove analogs of the theorems proved 
for /\(V) where possible. 

3. If ge AV) i=1,...,k, show that p^ A OV a0 W) = 
det(p;(v;)), a k x k determinant. 

4. Show that the C® exterior forms of order r on M are exactly the func- 
tions ®: ¥(M) x- x ¥(M)—> C*(M) which are multilinear, in the 
sense of C°(M) modules, and alternating. Find an example of such a 
function which is R-linear but not a tensor field. (Hint: Use [X, Y].) 

5. Let Q,,.-.,@, be elements of V* = /\'(V). Show that they are linearly 
dependent if and only if 9, ^ ^P, = 0. 

6. Assume ge /\'(V) and ve V. Define an element i(v)@ of ANNIV) by 


(i(v)g)(¥1, -> Ve-1) = Q(Y, Yis e- Vp—a)s 


Show that i(v) thus defined determines a linear mapping of /\"(¥) into 
ANTEN) and that if ge A\"(V), We AV), then i(v)(p av) = (i(v)p) A 
y + (—1Yọ 1 (i(v)W). Extend this definition and these properties to ex- 
terior forms on a manifold (with v replaced by a vector field). 

7. A Riemannian metric ® on a manifold is often denoted by ds? in local 
coordinates x!,..., x" on M with ds? = )? j=; g(x) dx! dx’. Interpret 
this by the use of tensor multiplication and Theorem 6.3. Show that 
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from this expression for ® and the formulas dx' = )"_, (éx'/dy’) dy’ we 
may derive the formula for change of components of ® relative to a 
change of local coordinates. 


7 Orientation of Manifolds and the Volume Element 


We shall make one application of differential forms in this paragraph. 
others in subsequent chapters. To do this we shall need the concept of an 
oriented vector space. Let V be a vector space and {e,,..., €n} ffi, ..., f,} be 
bases. The bases are said to have the same orientation if the determinant of 
the matrix of coefficients expressing one basis in terms of the other is posi- 
tive, that is, if det(x/) > 0, where f; = Yri ajej, i= 1,..., n. The reader 
should check that this is an equivalence relation on the set of all bases (or 
frames) of V and that there are exactly two equivalence classes. A choice of 
one of these is said to orient V so that we have the following definition: 


(7.1) Definition An oriented vector space is a vector space plus an equi- 
valence class of allowable bases: all those bases with the same orientation as 
a chosen one; they will be called oriented or positively oriented bases or 
frames. 


This concept is related to the choice of a basis Q of /\"(V). Recall that 
dim /\"(V) = (7) = 1, so that any nonzero element is a basis. The relation- 
ship to orientation appears as a corollary to the following lemma: 


(7.2) Lemma Let Q + 0 be an alternating covariant tensor on V of order 
n= dim V and let e,,...,e, be a basis of V. Then for any set of vectors 


Vi,--.,¥, with v; = } ajej, we have 


Q,,...,¥,) = det(a/)Q(e,,...,¢,). 


Proof This lemma says that up to a nonvanishing scalar multiple Q is 
the determinant of the components of its variables. In particular, if V = V” is 
the space of n-tuples and e,,..., e, is the canonical basis, then Q(v,,..., v,) 
is proportional to the determinant whose rows are v,...., v,- The proof is a 
consequence of the definition of determinant. Given Q and v}, ..., Yp, we use 
the linearity and antisymmetry of Q to write 


Q(v,,---5 Wn) = . È aji ob OQle,,, «<5, e) 
Since Q(e,,,..., 


Q0,,...,%) = Yo sgn cal... 0O(e,,...,e,) = det(«/)O(e;,...,e,). 


ceSn 


e,,) = 0 if two indices are equal, we may write 


The last equality is the standard definition of determinant (S, is the symme- 
tric group on n letters). | 
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(7.3) Corollary A nonvanishing Qe /\"(V) has the same sign (or opposite 
sign) on two bases if they have the same (respectively, opposite) orientation; 
thus choice ofan Q + 0 determines an orientation of V. Two such forms Q,, Q3 
determine the same orientation if and only if Q, = AQ, , where A is a positive 
real number. 


Proof From the formula of the lemma we see that Q has the same sign 
on equivalent bases and opposite sign on inequivalent bases. If 4 > 0, then 
AQ. has the same sign on any basis as Q does, whereas the contrary holds if 
A <0. I 


(7.4) Remark Note that if Q # 0, then v,,..., v, are linearly independent 
if and only if Q(v,, ..., v,) # 0. Also note that the formula of the lemma can 
be construed as a formula for change of component of Q2—there is just one 
component since dim /\"(V) = 1—when we change from the basis e;, ..., €n 
of V to the basis v,,..., v,. These statements are immediate consequences of 
the formula in the lemma. 


If V is a Euclidean vector space, that is, has a positive definite inner 
product ®(v, w), then in orienting V we may choose an orthonormal basis 
e,,...,, to determine the orientation and choose an n-form Q whose value 
on e;,...,e, is +1. If f; = } alej is another orthonormal basis, then 


Q(f,, ivan fn) = det(x/)Q(e,, sects e,) 4 1, 


depending on whether f,, ..., f, is similarly or oppositely oriented. [We have 
used the fact that the determinant of an orthogonal matrix is + 1.] Thus the 
value of Q on any orthonormal basis is + 1 and Q is uniquely determined up 
to its sign by this property. In this case the form Q may be given a geometric 
meaning when n = 2 or 3; Q(v,, v2) or Q(v,, V2, Y3) is the area or volume, 
respectively, of the parallelogram or parallelepiped of which the given vec- 
tors are the sides from the origin. This is a standard formula from analytical 
geometry and serves as a geometric motivation for some later applications. 
(See Exercise 2.) 

To extend the concept of orientation to a manifold M one must try to 
orient each of the tangent spaces T,(M) in such a way that orientation of 
nearby tangent spaces agree. We will do this in two ways and then demon- 
strate their equivalence as an application of the ideas of this chapter. 


(7.5) Definition We shall say that M is orientable if it is possible to define 
a C°” n-form Q on M which is not zero at any point—in which case M is said 
to be oriented by the choice of Q. 


By virtue of Corollary 7.3 any such Q orients each tangent space. Of 
course any form 9’ = åQ, where 4 > 0 is a C” function, would give M the 
same orientation. 
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Thus R” with the form Q = dx! a ++- a dx" is an example; this is known as 
the natural orientation of R" and corresponds to the orientation of the frames 
6/éx',...,6/6x". If U c R" is an open set, it is oriented by Qy = Q|, and we 
say that a diffeomorphism F: U > V c R" is orientation preserving if 
F*Q, = 24Qy, A > Oa C” function on U. More generally a diffeomorphism 
F: M, > M, of manifolds oriented by Q,,,, respectively, is orientation- 
preserving if F*Q, = 2Q,, where å > 0 is a C® function on M. 

A second, perhaps more natural definition of orientability could be given 
as follows: M is orientable if it can be covered with coherently oriented 
coordinate neighborhoods {U,,@,}, that is, neighborhoods such that if 
U, ^n U, + Ø, then y, ° p; is orientation-preserving. We shall now see 
that this second definition is equivalent to Definition 7.5. 


(7.6) Theorem 4A manifold M is orientable if and only if it has a covering 
{U,, Pa} of coherently oriented coordinate neighborhoods. 


Proof First suppose that M is orientable and let Q be a nowhere vanish- 
ing n-form that determines the orientation. We choose any covering {U,, g,} 
by coordinate neighborhoods, with local coordinates x}, ..., x" such that for 
Q restricted to U, we have the expression in local coordinates 


@, Oy, = Ax) dx} A Ad with 4, > 0. 


We may easily choose coordinates so that the scalar function 4, , component 
of Q, is positive on U,, since replacing coordinates (x',..., x") by 
(—x',..., x"), that is, changing the sign of one coordinate, changes the sign 


of 4. An easy computation, using Lemma 7.2 and Remark 7.4, shows that if 
U, ^ Ug # Ø, then on this set the formula for change of component is 


, A OX, 
(16) h det fal Sip 
Since 4, > 0 and A, > 0, the determinant of the Jacobian is positive, so the 
coordinate neighborhoods we have chosen are coherently oriented. 

Now suppose that M has a covering by coherently oriented coordinate 
neighborhoods {U,, g,}. We use a subordinate partition of unity {f} to 
construct an n-form Q on M which does not vanish at any point. For each 
i = 1, 2,... we choose a coordinate neighborhood U,,, g,, of the covering 
such that U,, > supp f;; these neighborhoods, which we relabel U;, p;, cover 
M. If U;n U; # Ø, then by assumption the determinant of the Jacobian 
matrix of g; ° p; ' is positive on U; n U}. Define Qe /\"(M) by 


Q =} _fiož(dx} aa dxf), 


extending each summand to all of M by defining it to be zero outside the 
closed set supp f;. Letting pe M be arbitrary, we will show Q, + 0. By the 
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local finiteness of {supp f;} we may choose a coordinate neighborhood V, y 
of p which is coherently oriented to the U;, p; and intersects only a finite 
number of the sets supp f;, say for i = i,,..., ip. Let y’,..., y" be the local 
coordinates in V and use formula (7.6’), on each summand to change 
components: 


k 
= $ fips (dxi,a- A dxi) = $ Silo) ) de“) y*(dy! av Ady"). 
J=1 W(p) 


Now each f;, > 0 on M and at least one of them is positive at p; moreover, 
the Jacobian determinants are all positive. This implies Q, # 0 and since p 
was arbitrary, Q is never zero on M. I 


A Riemannian manifold has the special property that the tangent space 
T,(M) at every point p has an inner product. Applying our remarks about 
n-forms on a Euclidean vector space of dimension n, we have the following 
theorem: 


(7.7) Theorem Let M be an orientable Riemannian manifold with Rieman- 
nian metric ®. Corresponding to an orientation of M there is a uniquely 
determined n-form Q which gives the orientation and which has the value + 1 on 
every oriented orthonormal frame. 


Proof It is clear from our earlier discussion that at each point pe M, Q, 
is determined uniquely by the requirement that on any oriented orthonor- 
mal basis F,,, .-., Fn» of T,(M) we have Q (Fip Fap) = +1. Let U, o be 
any coordinate neighborhood with coordinate frames E}, ..., E,,. The func- 
tions g;(p) = ®,(E;,. Ej), peU, define the components of ® relative to 
these local coordinates and are C® by definition. We shall derive an expres- 
sion for the component Q(E,, ..., E,) on U in terms of the matrix (g;;), from 
which it will be apparent that Q is a C* n-form. Choose at pe U any 
oriented, orthonormal basis F,,...,F,, and let the n x n matrix (at) 
denote the components of E,,,.-., Enp with respect to this basis: 


= $ yas be dicey tt. 
k=1 
Then since ®(F,,,, Fip) = da, we have 
gp) = © (Eip, E jp) = 05 ot Fpp x aF) = > olf cel 
k I k=1 


for 1 < i,j < n. This may be written as a matrix equation: 


(9:;(P)) = ‘AA, 
the product of the transpose of A = (af) with 4 itself. 
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On the other hand Q,(E,,,--., Enp) = det(af)Q,(Fip,.--,Frp) by 
Lemma 7.2, and Q(F ,,,..-, Fap) = +1 by our definitions. Since det('4 A) = 
(det A)? = det(g;,), this gives for the component of Q in local coordinates 


Q,(E1p +--+ Enp) = (det(gis(P)))"”, 


which tells us that the component, being the square root of a positive C” 
function of pe U, is itself a C” function on the local coordinate neighbor- 
hood U. Since U, ọ is arbitrary, Q is a C* n-form on M. | 


This form Q is called the (natural) volume element of the oriented 
Riemannian manifold. We have just seen that in local coordinates we have 
the following expression for Q: 


(7.8) go '*Q = g dx! a a dx", 


where g(x) = det(g;;(x)). [We use the same notation for g;; as functions on U 
and on ọ(U).] When M = R” with the usual coordinates and metric, this 
becomes Q = dx! a++- a dx”. In this case, as we remarked earlier, the value 
of Q, on a set of vectors is the volume of the parallelepiped whose edges from 
p are these vectors. In particular, on the unit cube with vertex at p and sides 
é/éx', ..., @/x", Q has the value + 1. As might be anticipated, the existence 
of the form Q on a Riemannian manifold will enable us to define the volume 
of suitable subsets of the manifold and to extend to these manifolds the 
volume integrals defined in R” in integral calculus. 


Exercises 


1. Using the definition of dx’ a++: a dx” on R” from Corollary 6.7, show 
that its value on ¢/éx',..., €/Ex" is indeed +1 so that this is Q for R” 
with the standard Riemannian metric, as claimed above. 

2. Prove that the volume of the parallelepiped of R? whose vertex is at the 
origin and whose sides (from this vertex) are the vectors 
v; = (x}, x7, x2), i = 1, 2, 3, is in fact the determinant of the matrix (x/). 

3. Show that n x n determinants as functions of the n-rows v,,..., V, are 
completely characterized by being alternating n-tensors on V” whose 
value on the standard basis is + 1. 

4. Compute the expression for Q on $° (with the induced metric of R*) in 
terms of the coordinates given by: (i) stereographic projection and (ii) 
spherical coordinates (p, 6, œ) with p = 1. 


8 Exterior Differentiation 


Much of this chapter has been devoted to extending the concepts of 
covariant tensors, and of operations on covariant tensors, on a single vector 
space V to tensors and tensor operations on manifolds. This was done 
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according to a very standardized procedure which consisted in viewing each 
tangent space T,(M) as a copy of V and thus extending the tensor or tensor 
operation point-by-point, making suitable restrictions to ensure some sort of 
smooth variation. By using very different ideas we now introduce an impor- 
tant operator d mapping /\(M) onto itself. It is defined in terms of differen- 
tiation and is known as the exterior derivative; it has no analog on /\( V), the 
exterior algebra of a single vector space. 

When U is an open subset of M we shall denote by 6, the restriction of 
an exterior form on M to U; of course Oy = i*0, i: U > M being the inclu- 
sion map. When U, g is a coordinate neighborhood with x’, ..., x" as coor- 
dinate functions on U, that is, p(q) = (x'(q), ---, x"(q)), then the differentials 
of these functions dx!, ..., dx" are linearly independent elements of /\'(U) 
and constitute a C® field of coframes on U. It follows that they, with 1, 
generate /\(U) over C*(U), or equivalently, C?(U) = /\°(U) and /\*(U) 
generate the algebra /\(U) over R. Thus locally every k-form @ on M has a 
unique representation on U of the form 


b= Y Guage dxa ads di -i€ C”(U), 
i<- <i 
the summation being over all sets of indices such that 1 < i <i, <Ñ 
< i, < n. If we define b;,..; for all values of the indices so as to change sign 
whenever two indices are permuted—in particular to be zero if two indices 
are equal—and to equal a,,...;, if i, <- < i,, then we also have a unique 
representation 


1 A f 
by = È gy Pi ag AXA a dx", 


the summation being over all values of the indices. Both representations are 
used in practice. We are using dx',...,dx" to denote the coordinate 


coframes, rather than œ!, ..., œ” as in Section 1, in order to emphasize that 
the dx’ are differentials of functions on U c M. This is important in what 
follows. 


(8.1) Theorem Let M be any C® manifold and let /\(M) be the algebra of 
exterior differential forms on M. Then there exists a unique R-linear map 
dy: /\(M) > /\(M) such that: 

(1) iffe \°(M) = C*(M), then dy f = df, the differential of f; 

(2) if Oe /\"(M) and ce/\(M), then dy(@rc) = dy^ + 

(—170 ^ dyo; 

(3) dj, =0. 
This map will commute with restriction to open sets U c M, i.e., (dy®)u = 
dyOy, and map /\"(M) into N+M). 
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Proof We give the proof in a series of steps. 


(A) We remark that if dy exists and g, f',..., f” € C” (M), then (1)~(3) 
imply that for 8 = g df! a++- ^ df" we must have 


dyb = dgadf! a adf". 
Now suppose that M is covered by a single coordinate neighborhood 
U, » with coordinate functions xt, ..., x”. The above remark and linearity 
imply that dy must be given by the formula 
(x) dyl} a; ony A An) = Fda, p AO A a dx*, 
where 


"Qj, i 
da, ...; = ut dx’ 
ai, i, A A x 


and the summation in (*) is over 1 <i, <i, < <i, < n. Therefore, if 
defined at all, dy is unique in this case. 

Conversely, the dy defined by («) is linear and trivially satisfies (1) and 
(3). To check (2) it is enough to consider forms 6 = a dx" a++- a dx” and 
o = b dx” a++- adx", the general statement being then a consequence of 
linearity. We have 


dy[(a dx" a +++ A dx'") a (b dx?! a -+ a dx*:)} 
= dy(ab)(dx" a+: a dx'") a (dxi! a +- a dx*) 
= [(dya)b + a(dyb)] A (dx A+++ A dx") a (dx?! A+++ a dx”) 
= (dyan dx" a:a dx”) a (b dx?! a+ a dx”) 
+(—1y(adx" A+++ a dx) a (db adx’! a ++ a dx*), 
which completes the proof. The (— 1} is due to the fact that 
db a dx" a++ a dx* = (— 1Y dx” n+ a dx* a db. 


(B) Now suppose that dy: /\(M) > /\(M) with properties (1)-(3) is 
defined and that U c M is a coordinate neighborhood on M with coordin- 
ate functions x',..., x". According to (A), dy: (U) > A (U) is uniquely 
defined by (+). We will show that for any 0 e /\(M) the restriction of dy 6 to 
U is equal to dy applied to 0 restricted to U: 


(du )u = dy Oy . 
We may suppose that 0 € /\'(M) and that 
Oy = Daj, p Ax acia dx", a, € C®(U). 
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Suppose p is an arbitrary point of U. Applying Corollary HI.3.5 to an open 
set W, pe W and We U, we may find a neighborhood V of p with V = W 
and C” functions y',..., y” and 6;,...,, on M vanishing outside W but are 


identical to x',..., x" and a;, ...;,, respectively, on V. Define o € ANM) by 
o= F b, np dy atta dy”. 


Then ø is an r-form on M which vanishes outside W and is identical to 0 on 
V. Now let g be a C7 function on M which has the value + 1 at p and is zero 
outside V. The r-form g(0 — c) vanishes everywhere on M as does 
dg ^ (8 — a). Therefore, using (A), 


g dyb =g dys = g >. da; pn dy andy". 


On V we have 
F db; pandy att ady" = } da; ... „adx AS a dx" 


so that at the point p, where g(p) = 1, du 0 = dy Oy . Since p is arbitrary, this 
holds throughout U. 

(C) If dy: (M) > A (M) satisfying (1)-(3) exists, it is unique. Indeed, 
let {U,, p,} be any covering of M by coordinate neighborhoods; each dy, 
exists by (A); and for any 0 e /\(M) we have (dy 9)u, = du,9u, for any U, by 
(B). Since every pe M lies in a neighborhood U,, this would determine dy, 
completely. 

On the other hand, we may use this formula to define dy . To do so we 
must verify that if pe U, ^ Ug, then dy @ is uniquely determined at p. This 
essentially repeats the argument above: Let U = U, ^ Ug; applying (A) and 
(B) to U, an open subset and coordinate neighborhood with coordinate map 
p cut down to U, we have 


(dv, 9u,)u = dy By = (du,9u,)u è 


Therefore (dy O)y, is determined on every U, in such a manner that 
(du 0)uv, = (dm 9)u, ON points common to U, and U,. This determines dy. 

Because (1)-(3) hold on each U, and the other operations of exterior 
algebra commute with restriction: (0 + a)y = Oy + oy, (0 A a)y = Oy A 
Gy, etc., dy has the required properties as an operator on /\(M). This 


completes the proof. 


Since dy is uniquely defined for every C® manifold M, we can drop the 
subscript M and use d to denote all of these operators. We know from the 
above proof that d commutes with restriction of differential forms to coor- 
dinate neighborhoods. It is important to know how it behaves relative to a 
C® mapping F: M > N. Any such mapping, as we know, induces a homo- 
morphism F*: A(N) > /\(M). The following theorem gives the relation of 
F* and d. 
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(8.2) Theorem F* and d commute, that is F*-d = d= F*. 


Proof We know that both F* and d are R-linear and that the equality 
F*(dg) = d(F*@) holds on M if it holds locally. More precisely, by virtue of 
the facts concerning d determined above we see that the theorem will hold if 
we can establish it for any pairs V, y, U, 0 of coordinate neighborhoods on 
M, N, respectively, such that F(V) = U. Let m = dim M and n = dim N 
and x!,..., xX" and y',..., y" be the coordinate functions on V, U, respec- 
tively, with y} = w(x',..., x”), j = 1, ..., n, giving the map F in local coor- 
dinates. Then it is enough to establish F* o d = d o F* on forms of the 
following type: 


o = aly) dy" a a dy", 
any other forms being the sum of such. We proceed by induction on the 
degree of the forms. For forms a(y) of degree zero, that is, C* functions, we 
have 
F*(da) = d(F*a), 
since 
F*(da)(X ,) = da(F,,X,) = (F,X,)a = X,(a° F) = X ,(F*a) = d(F*a)(X,). 


(Note: By definition, F*a = ao F.) 

Suppose the theorem to be true for all forms of degree less than k and let 
ọ be a k-form of the type above. Let 9, = a dy''and g, = dy’? a+: a dy™ so 
that @ = p; A @2 with both p, and o, of degree less than k; moreover since 
d? = 0, we have dg, = 0. Thus 


A(F*(p, A Q2)) = A[(F*Q1) ^ (F*@2)) 
= (dF*p,)A (F*Q2) — (F*Q,) A (dF*@2) 
= F*(dp,)A F*p, = F*(dg, n p2) = F*d(gi A 2). D 


An Application to Frobenius’ Theorem 


The algebra of exterior differential forms /\(M) on a C® manifold M, 
with the operator d just defined, is very important in the application of 
calculus to manifolds. Forms are involved in integration on manifolds 
(especially in extending Gauss’, Stokes’ and Green’s theorems); in the alge- 
braic topology of the manifold via the theorems of de Rham and Hodge; and 
in the study of partial differential equations. We will touch on the first two 
topics later. As to the differential equations aspect, we will show next that 
the essential data and hypothesis of Frobenius’ theorem can be stated in 
terms of d and /\(M). 

On a vector space V of dimension n, a k-dimensional subspace D may be 
determined in either of two equivalent ways: (i) by giving a basis e}, ..., e, of 
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D or (ii) by giving n—k linearly independent elements of V*, say 
g**},..., p" which are zero on D. In fact we may extend e,, ..., € to a basis 
e,,..-,€, of V so that g**!,..., g" is part of a dual basis ot, ..., g" of V*. 

Similarly, if A is a C” distribution of dimension k on M, an n-manifold, 
then locally, say in a coordinate neighborhood V, y, we may suppose A is 
defined by n — k linearly independent 1-forms g**’, ..., g". We may restate 
the condition that A be involutive—hence Frobenius’ theorem—as follows: 


(8.3) Theorem Let A be a C® distribution of dimension k on M, 
dim M = n. Then A is involutive if and only in a neighborhood V of each pe M 
there exist n — k linearly independent one-forms g**', p**?,..., g" which 


vanish on A and satisfy the condition 


dg = ¥ Gag, r=k+1,...,n, 


f=k+1 
for suitable 1-forms @. 


Proof This may be considered a sort of dual statement to our earlier 
condition on A in terms of the existence of a local basis X 4, ..., X, at each 
point. (Just as in that case, we may state the conditions in a fashion which 
does not depend on local bases. This will be done below (8.7), with proof left 
to the exercises.) 

Suppose a distribution A is given. Then in a neighborhood V of each 
point a local basis X,,..., X; oF A can be completed to a field of frames 
Xia Xk Xn. If o!,..., 0, o**1,..., 0" is the uniquely determined 
dual field of coframes, then g**', ..., g” vanish on X,,..., X, and hence on 
A. The distribution A is involutive by Definition IV.8.2 if and only if in the 
expressions [X;, X] = )7-1 ci; X:, giving [X;, X,] as linear combinations 
of the basis, we have ci; = Oforl <i,j<skandk+1<I<n. 


(8.4) Lemma Let we /\'(M) and X, Y e ¥(M). Then we have 
dw(X, Y) = Xo(Y) — Ya(X) — w([X, Y]). 
Let us assume the lemma and proceed with the proof. We compute dg’, 
using the lemma and recalling that g'(X,) is constant for 1 < i,j < n. We 
have then, 


do"(X;.X)) = -oX XD = = Le) = i 


for 1 < i,j,r <n. On the other hand 


PDF pdfFactory Pro www.fineprint.cn 


8 EXTERIOR DIFFERENTIATION : 225 


where bf, are uniquely determined if we assume bi, = —bi,. Hence 
dg’(X;, Xj) =4 pe LX JPX) — o(X)e(X)) 
= 2(bi; — b'i i) = bij . 


From this we have bf, = — c}; and so the system is involutive if and only 
if for each r > k 


n k n 
dg’ = 2 PU gp + X molao! 
1 li=1 j=k+1 


that is, the terms involving bj, with 1 < i,j < kandr > k vanish. Taking the 
terms in { } as 0; we have completed the proof except for Lemma 8.4. Jj 


Proof of Lemma 8.4 It is enough to prove that it is true locally, say ina 
coordinate neighborhood of each point. In any such neighborhood with 
coordinates x!,..., x",@ = )"_, a, dx’ and it is easy to see that the equation 
of the lemma holds for all @ if it holds for every œ of the form f dg, where f, g 
are C” functions on the neighborhood. Suppose, then, that w = f dg and let 
X, Y be C*”-vector fields. Then, evaluating both sides of the equation of the 
lemma separately, we obtain 


dw(X, Y) = dfa dg(X, Y) = df (X) dg(Y) — dg(X) df (Y) 
= (Xf (Yg) — (Xg)(¥f) 
and 
Xo(Y) — Yo(X) — o(l[X, Y]) = X(f dg(¥)) — ¥(fdg(X)) — fag (LX, Y]) 
= X(f (Yg)) — Y(f(Xg)) — f(XYg — YXq) 
= (Xf Yg) — (Xg Yf) 


after cancellation. This proves the lemma. (See Exercise 3 for a generaliza- 
tion.) I 


We can state Theorem 8.3 in a more elegant way if we introduce the 
concept of an ideal of A (M 


(8.5) Definition An ideal of /\(M) is a subspace .¥ which has the property 
that whenever pe. % and 8e A(M), then 8 ^ ge.F. 


(8.6) Example Let Z be a subspace of /\'(M), that is, a collection of 
one-forms closed under addition and multiplication by real numbers. Then 
the set (M) a Z = {0A o|pe F} is an ideal, the ideal generated by ¥. 
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Now suppose A is a distribution on M and suppose that # is the collec- 
tion of 1-forms ọ on M which vanish on A, that is, for each pe M, 
p(X) = 0 for all X,¢A,. f is a subspace; in fact, if fe C7(M) and ge f, 
then fo e f. We fave! 


(8.7) A is in involution if and only if df = {dọ | € J} is in the ideal gen- 
erated by J. 


The proof is left to the exercises. 


Exercises 


These exercises involve differential forms on a manifold M. A differential form g on M is 
closed if dp = 0 and exact if p = d0 for some form 8 on M. 


1. Show that the closed forms are a subalgebra (over R) of /\(M), which 
contains the collection of exact forms as an ideal. If F: M > N is C”, 
then show that closed forms are mapped to closed forms and exact forms 
to exact forms by F*. 

2. Let M = R? and determine which of the following are closed and which 
are exact: 

(a) ọ = yzdx + xzdy + xy dz; 
(b) p=xdx + x?y? dy + yz dz; 
(c) 8 = 2xy? dx Ady + z dy ^dz. 
3. Show that the following generalization of Lemma 8.4 is true for every 


ge /N (M 
a 
AOR is... X41) = 2A (DIX; lX e Rie Xr) 


+2 E ee E E E T, 
i<j 


(where the caret means that the term is omitted). 
4. Prove (8.7). 


A C® vector field X on a manifold M determines two operators on differential forms, ix: 
NM)> N 1M) and Ly: N(M) > A'M), as follows (recall, AM) = C*(M)). Tf 
ge N(M), then 

(iy 9X q,-.-,X,—1) = Q(X, Xq,....X,-1) ifr > 1, and iy = 0 on A\O(M); Lx@ = ixdg + 

d(iy@) ifr > 1, and Ly f = Xf for f e A\°(M). 


The following exercises, some of which are difficult, develop some essential properties of these 
operators and are excellent exercises in the calculus of differential forms (see Notes below for 
references). 
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5. Show that iy has the following properties: 
G) iy: \(M)> NM) is C%(M)-linear (fixed X), and X > ix is 
C*(M)-linear for fixed g. 
(ii) iy = 0, and if pe N(M), y e NYM), then 


iglo A W)=(ixg) A wWt+(-l@ ^ (ix). 


6. Show that Ly has the following properties: 
(i) Ly: AM) > AM) is R-linear (but not C”(M)-linear). 
(ii) Lyd =dLy. 
(iti) If pe /\"(M) and we /\‘(M), then 


Lxlo a W)=(Lyg) ^ Ww +o (A\(LyW). 


(iv) Lryp =df ^ iyo + fLyọ for f e C”(M). 
7. Let 6: W—M denote local one-parameter group action generated by X 
and we /\"(M), r > 0, define Lyo 


| 
(Ly®)p = lim t LOF Pout, p) = Pp]. 


Show that the two definitions of Ly ọ agree. 
8. If pe /\"(M) and X,,...,X, E€ X(M), then verify the following formula 
for Lyo: 


(LOX Xi- X) = X(@(X,,...,X)) E O(X,,..., Ly Xj,...,X,). 


i=1 


Notes 


A translation of Riemann’s famous Inaugural Address, given at Gottingen on the occasion 
of his being named a Privat dozent (an instructor whose fees depended on how many students 
came to his lectures) may be found in the book of Spivak [2], which, as mentioned earlier, is 
particularly valuable source material for those interested in further reading on Riemannian 
geometry. Volume II of this book is built around an explanation of the often obscure meaning of 
Riemann’s tecture. This material is also relevant to Chapters VII and VIII below. 

The author does not know when partitions of unity were first introduced. They are some- 
what out of place in this chapter, but were placed here since this is the first point at which they 
were needed. Not all of the applications, as is seen, have to do with tensors. The imbedding 
theorem given here is a very weak version of the Whitney imbedding theorem for which proofs 
of stronger versions can be found in several of the references, for example. Auslander and 
Mackenzie [1], Hirsch [1], Guillemin and Pollack [1], and Sternberg [1]. These proofs are 
within the reach of the reader at this point (only Exercise 10 of VI.1 is still needed) and would 
form a valuable supplement to the text, especially for those interested in differential topology. 
The same is true of various approximation theorems (especially those of Munkres [1, Sections 3 
and 4], which will be very useful for readers who wish to pursue further the consequences of 
differentiable structure alone (without further geometric structure such as a Riemannian metric. 
Lie group structure, and so forth). This is basic to modern differentiable topology (see Milnor [2] 
for example and Hirsch [1] or Brocker and Janich [1]). 
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Readers who desire a more complete and general approach to tensors and tensor fields will 
find it in many of the texts listed in the references. Both Kobayashi and Nomizu [1] and 
Sternberg [1] begin with this subject and could be studied with profit at this point, as could any 
book on multilinear algebra, e.g. Bourbaki. 

Exterior differential forms were first used extensively by Elie Cartan whose work has 
enormously influenced all modern differential geometry and Lie group theory. The calculus of 
/\(M), the exterior algebra on M, is his creation and he made many applications of it, too 
numerous to discuss here. Some idea of his contributions may be found in the memorial 
article by Chern and Chevalley [1]. Abraham and Marsden [1] and Godbillon [1] develop the 
calculus of exterior forms much further than we do here in order to make applications to 
dynamical systems and Hamiltonian and Lagrangian mechanics. The latter has an especially 
elegant and concise treatment of the operators ix, Ly, and d and their interrelations. Both books 
are recommended for readers who wish to master this calculus—which is essential to a deeper 
study of the subject. 
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The chapter begins with a brief review (without proofs) of properties of multiple integrals 
over domains of R”. In the next section this theory is extended to C manifolds. The extension 
to manifolds involves two steps: first, we define integrals over the entire manifold M of suitable 
exterior n-forms and second, for those M which have a predetermined volume element (for 
example, Riemannian manifolds), integrals of functions over domains are defined. All the 
standard properties of integrals follow readily from the corresponding facts in the Euclidean 
case. As an illustration of the use of integration on manifolds an application is made to compact 
Lie groups. It is shown that by averaging a left-invariant Riemannian metric on a compact 
group one may obtain a bi-invariant Riemannian metric. With the same techniques—due to 
Weyl—it is shown that any representation of a compact group as a matrix group acting ona 
vector space leaves invariant some inner product on that vector space, from which it follows 
that any invariant subspace has a complementary invariant subspace. 

Following this, in Section 4, the concept of manifold with boundary is introduced. This 
generalizes the line interval, unit disk, and similar simple manifold-like objects needed if one is 
to discuss “pasting” together of manifolds—as in Chapter I—or differentiable homotopy. 
However, our immediate interest is in a statement and proof of Stokes’s theorem, using mani- 
folds with boundary as domains of integration. This theorem, a generalization of the fundamen- 
tal theorem of calculus, embodies Green’s theorem on the plane, the divergence theorem, and 
Stokes’s theorem of advanced calculus in a unified form. If M is a manifold with boundary ôM 
and w ann — 1 form on M, dim M = n, then the theorem asserts the equality of the integral of 
w over OM (with suitable orienttation) and dw over M. This theorem, proved in Section 5, 
concludes our development of the basic techniques of integration on manifolds. 
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The remainder of the chapter is devoted to various applications of the techniques accu- 
mulated thus far to the (algebraic) topology of manifolds. In order to introduce these ideas the 
concept of homotopy or deformation of mappings is introduced. The simplest case is the 
deformation over a manifold M of a loop based at be M, that is, a continuous image of 
0 < t < 1 with both endpoints at b. In general, not all loops can be deformed to one another on 
M (consider M = T?, for example). The classes of those which can be deformed to one another, 
with a suitable product, form a group—the Poincaré fundamental group of M. Although quite 
diverse in general, these groups are isomorphic for two homeomorphic manifolds, furnishing 
the simplest example of an algebraic object which measures topological invariants of a space. 

Following this, the de Rham groups are defined. They are the groups of closed k-forms 
modulo exact k-forms, and are used here, together with integration theory, to prove some 
classical theorems of topology (in the spirit of Milnor [2]). In particular, a proof is given of the 
Brouwer fixed point theorem and of the nonexistence of nowhere vanishing vector fields on 
even-dimensional spheres. Finally these techniques are once more applied to compact Lie 
groups to obtain—by way of example only—a few interesting scraps of information about their 
topology. 


1 Integration in R” Domains of Integration 


As might be expected, we begin with integration in Euclidean space and 
carry over to manifolds the basic ideas developed there, just as we have done 
for differential calculus in earlier chapters. The basic facts that we will need 
concerning integrals on various subsets of R” will be assumed known. We 
shall enumerate them here, and they may be found, proved in detail, in the 
references, for example, Apostol[1] or Spivak [1]. We need only the 
Riemann integral. However, we must admit domains of integration and 
functions which are slightly more complicated than those found in elemen- 
tary calculus. This is natural since a diffeomorphism, or change of coordin- 
ates, badly distorts even a simple region such as a cube. First, we proceed to 
define the domains of integration which we allow. 

We shall say that a subset A of R” has (n-dimensional) Jordan content 
zero, c(A) = 0, if for any £ > 0, there exists a finite collection of cubes 
Cy,...,C, which cover A and the sum of whose volumes is less than e, 
y= vol C; < e. If A satisfies a similar condition with the less rigid require- 
ment that for ¢ > 0 there exists a countable set of cubes covering A with 

x; vol C; < e, then we say that A has Lebesgue measure zero, m(A) = 0. 
These are not equivalent concepts. It is easy to see, for example, that the 
subset of rational numbers in R has measure zero but not content zero. 
However, c(A) = 0 implies m(A) = 0 and, if A is compact, the converse also 
holds. More generally m(A) = 0 if and only if A is a countable union of sets 
of content zero. 


(1.1) Definition A bounded subset D of R” is said to be a domain of 
integration if its boundary Bd D has content zero. A function f on R” is said 
to be almost continuous if the set of points at which it fails to be continuous 
has content zero. 
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The most obvious example of a domain of integration is a cube, or an 
n-ball. The usual domains of integration in R? or R?, bounded by piecewise 
differentiable curves or surfaces, are also examples. 


(1.2) Theorem Let D be a domain of integration in R" and let f be a real- 
valued function on D. Suppose that f is bounded and almost continuous on D. 
Then the Riemann integral |p f dv exists. 


We shall refer to a function with these properties as integrable on D. To 
say that the integral exists means, of course, that it is a limit of approximat- 
ing sums in the usual sense. The proof is essentially the same as that which is 
at least outlined in every calculus book. It is a very useful exercise to carry it 
out in detail and then to verify the following properties which are relatively 
easy consequences of the reasoning used in proving existence. 


Basic Properties of the Riemann Integral 


Let D, D,, and D, denote domains of integration in R” and f, g bounded 
almost continuous functions on R”. It is not too difficult to show that D, the 
closure of D, and D, the interior of D, are also domains of integration as are 
D, ov Da, Di ^D, and D, — D}. We have the following standard 


properties: 


(1.3) If c(D) = 0, then Í fdv=0. 


4) | fdv =f fdv+ | fw- fd. 
“Di u D3 Di D2 “Di nD2 
(1.5) | (af + bg) dv = a | fdv + b{g dv forall a,beR. 
Jp . 
(1.6) Suppose f > 0 on D and c(D) #0. Then | fdv=0. Equality 
D 


holds if and only if f = 0 at every point at which it is continuous. 


Recall that the characteristic function k, of a subset A of a space X is 
defined to be identically equal to +1 on A and 0 outside A, that is, on the 
complement of A. Therefore k , is bounded and its discontinuities are exactly 
the set of boundary points of A, Bd A. In particular, if D is a domain of 
integration, we have c(Bd D) = 0 so that kp is integrable. If D’ is a domain of 
integration which contains D, then fp. kpf dv = fp fdv. Thus if f on R” is 
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bounded, has compact support, and is almost continuous, then we define 
fae f dv unambiguously by fgn fdv = fp fdv, using any domain of integra- 
tion D such that D > supp f: 


(1.7) Definition Let D be any domain of integration. Then we define the 
volume of D, vol D, by 


vol D = [ kp dv = | kp dv. 
D 


| Rn 


The following property is an easy consequence of the definitions: 


(1.8) (in s) vol D < | fdv< (sap) vol D. 
D D D 
When D is connected and f is continuous, we obtain the mean value property 
| fd = f(a) vol D 
“D 
for some point ae D. 


The following theorem, a special case of Fubini’s theorem, is more 
difficult to prove than the above properties, although we need only the 
simplified version below. It justifies the usual evaluation of multiple integrals 
by repeated single integrations of functions of one variable (iterated 
integrals). 


(1.9) Theorem If f is a continuous function on the domain of integration 
D = {xe R": a < xê < bf, i = 1, ..., n}, then 


| fasl f Sar a de 


“an “al 
the expression on the right denoting repeated single integrations. 
We shall need one further theorem from advanced calculus, the principle 
which allows us to change the variable, or variables, of integration. In the 


case of a function of one variable this is the standard and indispensable 
technique of substitution, which allows us to write 


Tro) dy = [ ree ® dx, 


where y = g(x), a < x < b, with c = g(a) and d = g(b). Unless we assume 
dy/dx > 0, we encounter problems in this formula. If this condition is 
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satisfied, then y = g(x) may be considered as a change of variable, or as a 
diffeomorphism of [a, b] onto [c, d]. The general multiple-integral statement 
is less familiar in elementary calculus, although it occurs, for example, in the 
passage from Cartesian to polar, cylindrical, or spherical coordinates. It is 
proved in most advanced calculus courses for n = 2 or 3 at least, so we will 
assume it without proof. This is the most difficult of the standard theorems 
of integral calculus which we will expect of the reader. It will be essential to 
us in extending Riemann integration to manifolds, since we clearly must 
know the effect of change of coordinates on the value of an integral. 

Let us denote by G: U > U’ a diffeomorphism of U c R” onto U' c R" 
and by AG the determinant of its Jacobian. We suppose G to be given by 
coordinate functions y = y'(x), i= 1, ..., n. Then AG = det(@y'/éx’). A 
function f’ on U’ determines a function f = f'o G on U and we have the 
following relation of their integrals. 


(1.10) Theorem (Change of Variables) Suppose Dc U and 
D' = G(D) = U' are domains of integration and that f’ is integrable on D'. Let 
f=f'°G, that is, f(x", ..., x") = f'(g"(x), ..., g"(x)). Then f is integrable on 
D and 


| oa = | £6) |AG| dv = | s |AG] dv. 


(1.11) Example Let 
D = {p, 0, )|0 <a < p <b, 0 <8 < n/2, n/4< o < 2/2} 


and D’ be the first quadrant region of xyz-space between the spheres with 
center at the origin and radii a and b, and outside the inverted cone 
z? = x? + y? (Fig. VI.1). Let G be given by the coordinate functions 


x = p sin ọ cos 0, y = p sin ọ sin 0, Zz = p cos Q. 


Figure VI.1 
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Given f'(x, y, z) = x? + y? + 2’, then f= f'o G is 
f (0, 0, 9) = f'(p sin ọ cos 0, p sin ọ sin 0, p cos p) = p? 
and A = p° sing so that 


| (x? + y? + 2?) dx dy dz = | p? |p? sin | dp dọ do. 
m “D 


If we are to extend these ideas to C” manifolds, we need to know what 
happens to domains of integration under diffeomorphisms. A cube in R”, for 
example, is such a domain since its boundary—the faces—have zero (n- 
dimensional) content. Does it remain a domain of integration after a diffeo- 
morphism? If we recall that there are continuous images of an interval, 
0 <t < 1, which fill a square, this question seems less trivial; it is possible 
that the image of the boundary of a cube could become very large under a 
differentiable mapping. The following lemma will thus be crucial 


(1.12) Lemma Let A be a compact subset of R” and let F: A R", n < m be 
a C! mapping. Then F(A) has content zero if n< m or if n= m and A has 
content zero. 


Proof By definition F must be C on an open set U > A and since A is 
compact, we may choose an open set V > A such that V is a compact subset 
of U. Let K = sup,.;|df'/@x/|, a bound of the derivatives on V of the 
coordinate functions of the map F. Choose 6,, 0 < 6, < 1, so that every cube 
of side 5, whose center is in A lies inside V. By the mean value theorem 
(Theorem II.2.2), we have 


F(x) — F(a)|| < (nm)? K|]x — al 


for any x in a cube of side 6, and center ae A.If5, > 6 > 0, then a cube C of 
side ô and center ae A must map into a cube C’ of center F(a) and side 
length less than or equal to (nm)'!?K6. Thus we see that F(C) lies in a cube 
C’ whose volume satisfies 


vol C’ < ((nm)'/?K6)" = (nmy"/*K™5"~ "6" < kô” " vol C, 


where k = K”(nm)™!? is independent of ae A and vol C = 6". Assume m =n 
and c(A) = 0. From this it follows at once that given any ¢ > 0, we may cover 
F(A) with a finite number of cubes C/,,..., C; whose total volume is less than 
e. We need only cover A with cubes C,,..., C, whose total volume is less than 
e/k and whose side is less than 6,. This shows that the content of F(A) is zero 
in this case. The case n < mis left to the reader. For an extension see Exercise 
10. I 


Using this lemma it is easy to extend the notions of zero content and zero 
measure to subsets of any C” manifold M of dimension n. 
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(1.13) Definition A subset A c M is said to have content zero, c(A) = 0, if 
it is contained in the union of a finite number of compact subsets A4;, 
AcA,vU--:UA,, each of which lies in a coordinate neighborhood U;, 9; 
such that c(y,(A,)) = 0 in R", i= 1,...,s. An arbitrary subset B c M is said 
to have measure zero, m(B) = 0, if B is the union of a countable collection of 
subsets B = |), B; such that each B; has content zero. 

In the light of this definition we may state, as a corollary to the lemma, 
the following facts about sets on a manifold: 


(1.14) Corollary If A c M has content (measure) zero and F: M >N is a 
C! map with dim M s dim N, then F(A) has content (measure) zero. In 
particular, this holds if F is a diffeomorphism. 


Proof This is an obvious application of Lemma 1.12 to Definition 1.13. 
If dim M < dim N, a stronger result holds (Exercise 10). | 


We define domain of integration in an arbitrary manifold precisely as we 
did for R": D c M is a domain of integration if D is relatively compact and 
the boundary of D has content zero, c(Bd D) = 0. [Note that in R” “rela- 
tively compact” is equivalent to “ bounded.”] We have analogous properties 
to those of domains of integration in R". 


(1.15) Theorem If D is a domain of integration in M, so are its closure and 
its interior. Finite unions and intersections of domains of integration are do- 
mains of integration and the image of a domain of integration under a diffeomor- 
phism is a domain of integration. 


Proof These are all immediate consequences of Definition 1.13 and of 
the corresponding statements for subsets of content zero and domains of 
integration in R”. For the last statement we must note that a diffeomorphism 
takes boundary points to boundary points. | 


Exercises 

l. Prove that a set of measure zero cannot contain any open set. 

2. Prove Theorem 1.2 for a general domain of integration in R? assuming 
that the integrand f is: (a) continuous on D, and then (b) continuous 
except for a set of content zero. 

3. Prove that finite unions and intersections of domains of integration are 
domains of integration. 

4. If D is a domain of integration show that D, its closure, and D, its 
interior, are domains of integration and that if fis integrable on D, then 


| fdv={| fdv= | fav. 
D “D "D 
5. Verify properties (1.3)-(1.6) of the Riemann integral. 
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6. Let D bea domain of integration in R" and a < t < ban open interval of 
R. Suppose f(x, t) is continuous on D x (a, b) and is of class C’ in t. 
Then prove that g(t) = fp f(x, t)dv is of class C' on (a,b) and 
dg/dt = Íp (Of/dt) dv. 

7. Suppose f, g are both integrable on a domain of integration D and that 
f> g. Then show that fp fdv > fpg dv. 

8. ae the change of variables theorem for a linear mapping G: R? > R°. 

9. If {f,(x)} is a sequence of continuous functions on D (a domain of 
integration in R”) converging uniformly to g(x), show that 
lim,» fo Sax) dv = fo glx) dv 

10. Prove that if F: M >N is a C! mapping and dim N > dim M, then 
F(M) c N has measure zero. 


2 A Generalization to Manifolds 


In this section we carry over to arbitrary oriented manifolds the concept 
of integral reviewed in the previous section. We first define the integral of an 
n-form w on the oriented manifold M of dimension n, denoting it by fy œ. It 
is only when we specialize to a more restricted class of manifolds, say 
Riemannian manifolds, that we are able to give meaning to the integral of a 
function on M over a domain D of integration in M and thus obtain a 
complete generalization of integrals on R". This is not surprising since 
definition of the Riemann integral in R” makes important use of volume, 
a metric concept, which is not defined on a general differentiable manifold. 

Suppose that M is an oriented manifold and dim M =n. By 
Definition V.7.5 this means that there is a C® n-form Q on M which is not 
zero at any point of M. It is called a volume element; any other n-form w is 
given by œ = fQ, where f is a function on M. Since Q is C”, œ will have the 
differentiability class of f. We use this to make the following definitions. 


(2.1) Definition A function fon M is integrable if it is bounded, has com- 
pact support (vanishes outside a compact set), and is almost continuous 
(that is, continuous except possibly on a set of content zero). An n-form w on 
M, in the very general sense of a function assigning to each pe M an element 
w, of /\\"(T,(M)), is said to be integrable if w = fQ, where fis an integrable 
function. [Note: We are not requiring w to be C® or even C4] 


We remark that the definition of integrable n-form does not depend on 
the particular Q we use. Any other © giving the orientation is of the form 
Q = gQ, where g is a positive C® function on M; thus fQ = f/gQ. If f has 
compact support, is bounded, and is almost continuous, then the same will 
be true of f/g. We will denote by /\$(M) the set of integrable n-forms. Like 
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/\"(M), it is a vector space over R; moreover, it is closed under multiplica- 
tion by continuous or integrable functions on M. 

We shall refer to a subset Q = M as a cube of M if it lies in the domain of 
an associated, oriented, coordinate neighborhood U, g and g(Q) = C = 
{xe R"|0 < xf < 1,i = 1,..., n}, the unit cube of R”. Thus a cube is a com- 
pact set and is coordinatized in a definite way. We first define the integral 
over M of any we /\3(M) whose support lies interior to some cube Q. Let 
U, ọ be the coordinate neighborhood associated with Q and suppose 


gp '*(w) = f(x) dx! aa dx" 


represents œ in the local coordinates. Then f is bounded and almost contin- 
uous on C so that fe fdv is defined. We define 


[o = | fd. 


We must show that the value of this integral is independent of the partic- 
ular cube we have used. Suppose Q’ is another cube containing supp œ and 
let U’, ọ' be the associated coordinate neighborhood. We denote the local 
coordinates for this neighborhood by y',..., y” and suppose that 


o MO) =f) dy! andy 


represents w on g’(U’). According to the rules for change of components of 
an n-form, we have 


F(x) = f'(G(x)) AG, 


where G = g'° gp ': p(UNU’) > g (Un U’)and AG is the determinant of 
the Jacobian matrix of this diffeomorphism; AG is positive since these are 
oriented neighborhoods (see Section V.7). On the other hand, since Q, Q’ are 
domains of integration, so are Q ^ Q’ and its images D = (Q ^ Q’) and 
D' = o'(Q ^ Q’) which lie in the unit cube of the x-coordinate space and the 
y-coordinate space, respectively. Moreover, suppw- QQ’, so 
supp f = D and supp f’ = D’. Therefore 


j £69 dv = | £69 dv and | £0) dv = [fo dv’. 


According to the change of variable theorem 1.10, and since D’ = G(D), we 
have 


l; f(y) dv = [ f'(G(x)) |AG | dv. 


However, AG > 0 so that |AG| = AG and the integral on the right must 
then be equal to fp f(x) dv by the above formula for change of components. 
This shows that fm œ is uniquely determined for every integrable œ which 
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vanishes outside of some cube. We note, in particular, the following linearity 
property: If @,, œ vanish outside a cube Q and aj, a, are real numbers, then 


aw, + a20 = 4 | o, + az | %2. 
M “M M 


Now suppose that œ is an arbitrary integrable n-form. We will define 
fu œin this case by representing w as the sum of a finite number of forms of 
the special type above and adding their integrals to obtain Su w. More 
precisely, let K = supp w and choose a finite covering of K by the interiors 
O,,...,0, of cubes Q;,...,Q, associated with coordinate neighborhoods 
Ui, Q1- --, Up, Øp, respectively. The open sets M — K, 0,, ..., Q, cover M. 
Take a partition of unity { f,} subordinate to this covering. By compactness of 
K and the local finiteness of {supp f;}, there is an s such that for i> s, 
K ^ supp f, = Ø, so that f; œ = 0. Since J, f; = 1, we have then 


o=frotr +f 
and we define 


|e = | fie qe rth | be. 


Each of the integrals on the right is defined since the integrand has its 
support on the interior @; of a cube Q;. 

The value of this integral does not depend on the choice of covering or 
the functions {f}. Let Q',..., Q, be another set of cubes whose interiors 
cover K and choose a partition of unity {g,} subordinate to M — K, 
O',..., Ôp. Again, g,@ = 0 if k is greater than some r. Then Vin Soe = 
Yi fi Ve Gx = 1 and for fixed k, 1 < k < r, we have supp fig, = Q; for some l. 
Therefore 


| no = | hno poet | foe 


by the linearity of the integral with respect to forms with support in the same 
cube. Therefore, if we compute fu w using this second covering by cubes, we 
have 


|o=} | awo} } | fioo. 

*M k=1 'M k=1 i=1°M 
However, by a symmetric argument the sum on the right is also equal to 
S21 fu f,@, hence both choices assign the same value to fu œ. This com- 
pletes the definition of the integral over M of integrable n-forms, we now list 
some of its properties. 
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(2.2) Theorem The process just defined assigns to each integrable n-form œw 
on an oriented manifold M a real number fy œ. We have the following 
properties: 


(i) If —M denotes the same underlying manifold with opposite orienta- 
tion, then f-y © = — fu ©. 
(ii) The mapping œ > fu œ is an R-linear mapping on /\§(M), that is: 


| 4101 + 4202 = a, |o, +a | œz, 
IM 4 ! 


aa ER and œ, œE /\3(M). 


(iii) If Q is a nowhere vanishing n-form giving the orientation of M and 
w = gQ with g > 0, then fu gQ > 0 and equality holds if and only if g = Q 
wherever it is continuous. 

(iv) If F: M, > M, is a diffeomorphism and we /\3(M;), then 


S vee He 


with sign depending on whether F preserves or reverses orientation. 


Proof Because of the definition of the integral, we need to verify these 
properties only for forms œ whose support lies in a cube Q associated with 
the oriented coordinate neighborhood U, g and coordinates x!,..., x”. 
Then by definition, fy «= fe f(x) dv, where g~!*(w) = f(x) dx! 

- A dx" If orientation of M is reserved, in evaluating | œ we replace U, o 
by, say, U' = U, g’ with g’ such that the Jacobian of g’ ° 7t has negative 
determinant, e.g., @ followed by interchange of first and second variables. 
This changes the sign of f since f is the component of œ in the local 
coordinates, hence it changes the sign of the integral. Property (ii) was 
previously noted; it is a consequence of the corresponding property for the 
Riemann integral on R". Property (iii) is clear once we note that in (oriented) 
local coordinates g~'*Q = p(x) dx’ a +- a dx", p(x) >0, so that fy 
gQ = fe g(x)p(x) dv. Since g(x)p(x) > 0, and vanishes exactly where g(x) 
vanishes, the assertion follows from the corresponding property in R". 
Finally, suppose F: M, > M, is a diffeomorphism which preserves orienta- 
tion. If œ on M, has support in a cube Q associated with the coordinate 
neighborhood U, g, then Q’ = F~'(Q) is a cube on M, associated with 
U' = F~'(U) and g' = ọ o F~! Using this cube, which contains the support 
of F*w, we have precisely the same expression f(x) dx! a --- A dx" for 
both w and F*w in local coordinates, hence the same integral |, f dv gives the 
value of both fy, œ and fu, F*w. If F does not preserve orientation, the 
equation Jy, F*w = — fm, œ follows from the orientation-preserving case 
and property (i). i 
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(2.3) Remark We note that a special case of the definition above, namely 
M = R”, defines 

| PP ieee ROE ORES 

R" 

for any bounded function f on R” which has compact support and is almost 
continuous. It is left as an exercise to show that if supp f = D, a domain of 
integration, then 


| f(x)dx! a adx" =| £69) dv, 


“Rn 


the usual Riemann integral. 


Integration on Riemannian Manifolds 


Thus far we have not defined integrals of functions, but rather integrals of 
n-forms. Even a cursory examination of the definition of integrals of func- 
tions over domains of R” used in advanced calculus shows that it assumes 
that we are able to assign a volume to certain classes of subsets of R”, say, 
cubes and rectangular parallelepipeds. In fact only this one ingredient is 
lacking; if M has a well-determined volume element, then we are able to pass 
from the definition above to integrals of functions on an oriented manifold 
M. A volume element is, by definition, a nowhere vanishing n-form Q on M 
which is in that class which determines the orientation. On an arbitrary 
oriented manifold there is such a form Q but it is determined only to within a 
multiple by a positive C” function. This is not enough to define volumes ; we 
must have a unique Q given, say, by the structure of M. One case in which 
this occurs, according to Theorem V.7.7, is on an oriented Riemannian man- 
ifold M. In this case there is a unique Q whose value on any orthonormal 
frame is + 1. We shall always use this Q on the Riemannian manifold and in 
the remainder of this section we shall discuss only the Riemannian case. 
Then, using Q and the characteristic function kp of a domain of integration 
D we are able to parallel the theory for R”. 


(2.4) Definition If D is a domain of integration on an oriented Rieman- 
nian manifold M and k, is the characteristic function of D, we define the 
volume of D, denoted by vol D, by vol D = fy kpQ. If f is any bounded, 
almost continuous function on M, we define the integral of f over D, denoted 
fof, by Sof =SufkpQ. When M is compact, we may take D= M and 
obtain vol M = fQ and Jy f = fu fQ. 


These integrals are defined since kp is continuous except on Bd D, which 
has content zero, and kp has compact support. 
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(2.5) Lemma With these definitions the integral of f on a domain of integra- 
tion on M satisfies properties (1.3)-(1.6) of the Riemann integral on R". It is 
equal to the Riemann integral when M = R" (with its standard metric). 


This is a consequence of the definitions and of the corresponding proper- 
ties (1.3)-(1.6) of the Riemann integral. One merely needs to demonstrate— 
by choosing a covering of D by the interiors of cubes and taking a 
corresponding partition of unity as in the definition of fy @—that it is 
possible to reduce the proof to verifying each property for the special case in 
which œ = fQ has its support in a single cube. In this case the properties 
coincide with the properties of the integral on R”. For the last statement we 
use Remark 2.3. 

We recall that in local coordinates U,@g with coordinate frames 
E,,...,£, and Riemannian metric tensor ®(X, Y), the matrix components 
®(E;, E;) on U are customarily denoted by g;;, i j = 1,..., n, with the same 
symbols g;; frequently used to denote g,(p) = ®,(E;,.£;,) and 
gij(x', ..., xX") = §,,(p(p)), that is, the components considered as functions 
on U c M or as the corresponding functions on g(U) c R”. In Section V.7 
we found that the local expression for Q on an oriented neighborhood was 


po Q = /gdx'n---ndx", g = det(g;;). 
We use this in the example below. 


(2.6) Example Let M be a surface in R? with the Riemannian metric 
induced by the standard metric of R° and let U, ọ be a coordinate neighbor- 
hood with coordinates (u, v). Suppose g(U) = W an open subset of the 
uv-plane. Let F = ~~‘ so that F: W > M has image U, and let F(u, v) = 
(f(u, v), g(u, v), h(u, v)) be the C*-coordinate functions for the mapping (see 
Fig. VI.2). As in Example IV.1.10 the coordinate frames E,, E, on U are 


o) Se aa ôhð 
ðu) ~ ðuðx ` ðuðy ` dudz’ 


rre) Se aa aho 
2 3v Ovdx ðvôx vêz’ 


E =F, 


and hence 


tao) = (2) + (SE) + (BY = E ead 


_ Of of . ag6g hêh 7 7 
912(u, v) Gu év + ĉu ôv + ĉu dv aa (E, E3) ~ (E2 > E,) =, J2i(u, v), 


a ee 
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x 
(a) (b) 


Figure VI.2 


These are denoted E, F, G, respectively, in the literature of classical differen- 
tial geometry and we have then 


7 '*Q = F*O = (911922 — 972)! dua dv = (EG — F?)!/2 du ^ dv. 


If D is a domain of integration on M such that D c U, and his an integrable 
function on D, then 


{ h= | hQ =| h(u, v)(EG — F?)'? dua dv 


D (D) 
= | h(u, v)(EG — F)*? du dv. 
g(D) 


Suppose, for example, that ¢ is the (diffeomorphic) projection of an open 
set U of M onto an open set W of the xy-plane, which we identify with the 
parameter plane. In this case F: W > U is given by F(x, y) = (x, y, f(x, y)) 
The graph of z = f (x, y) lying over W is the subset U of M, Fig. VI.3. The 


z=fix, y) 


Y 


PDF pdfFactory Pro www.fineprint.cn 


2 A GENERALIZATION TO MANIFOLDS 243 


coordinate frames are E, = 6/6x + f,6/6z and E, = é/éy + f; O/éz, so 
E=1 +f}, F =f.f,,G =1+?. Hence 


F*Q = (EG — F?)!? dx ady = (1 + f2 + f2)!? dx ^ dy. 


If D c U is a domain of integration and A c W its projection to the xy- 
plane, then for any integrable function h on M we have 


[ h= f h(x, y, z(1 + f2 + f?)'? dx dy. 
“D “A 


When h = 1, the value of this integral is the area of D [= vol D]. 

If, for example M = S?, the unit sphere, let U be the upper hemisphere 
and D=U. Then A=W={(x,y)|x?+y?< 1} and F(x,y)= 
(x, y, (1 — x? — y*)"?). The area of U is 


[ Q= [ (1 — x? — y?)7!? dx a dy 
fu Z 


.+1 (= y3Ż 
= (1 — x? — y?y Y? dx dy = 2r. 
*=1 *=(1-y3Ż 


(2.7) Remark In practice (or for theoretical purposes) one might hope 
that a compact manifold M could be covered by a finite number of domains 
of integration D,, ..., D, with the properties: (i) c(D; ^ Dj) =0, i#j, 
i j= 1,..., s, and (ii) each D; lies in a coordinate neighborhood U;, 9;. 
Then, using the fact that 


|f Trj 


it would be possible to evaluate each integral on the right separately as an 
integral on ọ;(D;) c R", 


| J=] Seada ade =] fa dr, 
“Di “@pi(Di) “gi(Di) 

where f (x) denotes the expression for fin local coordinates and g = det(g;;) 

as in the remark preceding Example 2.6. 


In fact, it can be shown that any differentiable manifold M (compact or 
not) can be covered with a collection of domains of integration D,, D3, ..., 
each the diffeomorphic image of a simplex (for n = 2 a triangle, for n = 3 a 
tetrahedron, and so on). Moreover these domains intersect in sets of content 
zero. [This is part of a theorem which asserts that any C” manifold is 
triangulable, An example is illustrated in Fig. VI.4.] When M is compact the 
number of D; is finite. This is not a complete description of a triangulation; 
for more details see Singer and Thorpe [1]. However, it shows that for both 
practical and theoretical purposes a technique of evaluation of Jy for fy Q 
is available. 
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Figure VI.4 
Triangulated manifolds. 


Exercises 


1. Interpret Q and volume for a curve in R?, that is, a one-dimensional 


manifold. 
2. Show that if D is a domain of integration on a manifold, then 


[ f=) Fai 


3. Let D be a domain of integration in R” and F: D > M be a continuous 
mapping into an n-manifold M. Suppose F is a C' diffeomorphism on D 
the interior of D, piecewise C! on ôD, and that œ is an integrable n- -form 
on M. Then show that F(D) is a domain of integration and that 
frm © = fp F*@. [We do not require F to be one-to-one on the 
boundary of D.] 

4. Find Q (for the induced Riemannian metric) on the torus T? in R? 
obtained by rotating a circle of radius a and center at (b, 0,0), 
b > a> 0, around the x3-axis. Use this to determine vol(T”). (Use 
Exercise 3.) 

5. Using Remark 2.7 integrate on M = S?, the unit sphere of R°, the func- 
tion f giving the distance of a point on M from the plane x = —l. 
Argue that we may use as D, and D, the upper and lower hemispheres. 
[Hint: Use Exercise 2.] 


3 Integration on Lie Groups 


One striking illustration of the uses to which integration on manifolds 
can be put arises when the manifold considered is a Lie group G. Although 
the most interesting case is a compact Lie group, for the present we allow G 
to be an arbitrary Lie group of dimension n. We shall need some simple 
observations concerning left and right translations and inner automor- 
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phisms of G. Given a, b € G, we denote by La, Rp, and 1, = L, > R,-1 left and 
right translation and the inner automorphism, I,(x) = axa™', of G, respec- 
tively. These are C® mappings with inverses L;' = L,-1, RJ} = R,-1, and 
1; = l,-ı. Hence they are diffeomorphisms and as such induce R-linear 
mappings of ¥(G)—the C®-vector fields on G—onto itself which preserve 
the bracket operation (Corollary IV.7.10). However, on G our main interest 
is in the subspace g of ¥(G) consisting of all left-invariant vector fields on G. 
As we have seen g is a Lie algebra, the Lie algebra of G, with respect to the 
product [X, Y]. Given a, b € G, we note the fact that the left and right trans- 
lations L, and R, commute—this is just the associative law a(xb) = (ax)b. 
From this we deduce that if X € g, then R,, X € g. Also 


Lg (Roy X) = Roa (Loa X) = R,, X. 


Similarly, Ilay X = Lay Ra-14 X = Ra-14 X € g; thus /,,:g > g. Because I,, 
is both a linear mapping and preserves the product, that is, J,,[X, Y] = 
[ax X, Ia Y], it is an automorphism of the Lie algebra g. Finally, note that 
Ia = lao 1, so that Ign, = lay ° Tog by the chain rule. Putting these facts 
together and adopting the notation Ad g for /,, , g any element of G, we have 
proved most of the following statement. 


(3.1) The mapping of G into the group of all automorphisms of g defined by 
g — Ad g is a homomorphism. Let Gl(g) denote the group of all nonsingular 
linear transformations of g as a vector space. Then Ad: G > Gl(g) is C*. 


It is only the last statement which requires proof and interpretation. In 
general, if V is a finite-dimensional vector space over R, then the group Gi(V) 
of all nonsingular linear transformations of V onto V is isomorphic to 
Gl(n, R), n = dim V. The isomorphism depends on the choice of a basis 
e;,..., €n of V and is given by letting A € GI(V) correspond to the matrix («;;) 
defined by A(e,) = ) 7-1 ae, j = 1,...,n. We take the topology and C” 
structure on GI(V) obtained by identifying it with the Lie group Gi(n, R). It 
may be shown (Exercise 3) that this C” structure is independent of the 
choice of basis. Therefore, if we choose a basis X,, ..., X,, of g and let (a,;(g)) 
denote the matrix corresponding in this way to Ad g, the last statement 
asserts that g++ (a,,(g)) is a C” mapping. Note that I,(e) = e, hence 
1,4: T(G) > T.(G). Because g may be naturally identified with T,(G) by 
identifying each X € g with its value X, at e, we may think of Ad g asa linear 
transformation on g—the left-invariant vector fields—or on T,(G) where, of 
course, it coincides with that induced by J, according to the definition. In 
particular, if we use the latter point of view, the matrix («;;(g)) is a submatrix 
of the Jacobian matrix evaluated at (g, e) of the C” mapping of G x G > G 
defined by (g, x) gxg~' = I(x). Hence g++ (a;,(g)) is C”. More generally, 
we make the following definitions. 
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(3.2) Definition A representation of a Lie group G ona vector space V isa 
Lie group homomorphism of G into the group Gi( V) of nonsingular linear 
transformations of V onto V. Its degree (dimension) is the dimension of V. A 
matrix representation of G of degree n is a Lie group homomorphism of G 
into Gl(n, R). The representation g++ Ad g is called the adjoint representa- 
tion of G. 


We remark again that we interpret Ad g both as a linear mapping on g, 
the space of invariant vector fields, and on T,(G), the tangent space at the 
identity. This is by virtue of the identification of g with T,(G). In either case 
Ad g is induced by the diffeomorphism I,(x) = gxg™ 1 of G onto G. 


Many questions about Lie groups may be reduced to questions about the 
adjoint representation of the group. Some examples are given in the exer- 
cises; we give another below. First we shall need a definition and a lemma. 


(3.3) Definition A covariant tensor field ® of order r on G is left- (right-) 
invariant if L¥®,, = ®, (or R7®,, = ®,, respectively). It is bi-invariant if it is 
both left- and right-invariant. 


We remark that any left- (or right-) invariant covariant tensor field 
Qe 7"(G) is necessarily C”. If X,,..., X, is a basis of C® left- (or right-) 
invariant vector fields, then ®(X;,,..., X;,) is constant—hence C”—on G for 
any 1 <ij;,...,i, < n. Therefore the components of ® with respect to a 
C®-frame field are C”, and ® is thus C°”. 


(3.4) Lemma Let ®, be a covariant tensor of order r on the tangent space 
T,(G) at the identity. Then there is a unique left-invariant tensor field and a 
unique right-invariant tensor field coinciding at e with ®,. These two agree 
everywhere on G, that is, ®, determines a bi-invariant tensor field, if and only if 


Ad (g)*®, = ®, for all geG. 


Proof Let ®, be given on T,(G). For each ge G we have a unique left 
translation L,:G—+G which takes e to g. Define ®e TG) by 
®, = Ly.. Then LI®,, = L4(L¥-1-1®,) = Lž o L1 ° Lg-1®, = Lj-1®,. 
However, this is just ®,, so we see that ® is left-invariant. Similar arguments 
show that R*_,®, is a right-invariant tensor field. Note that R} = 
(Rs ° Ra)* = Roe RŠ. 

If ® is bi-invariant, then Ad(g)*®, = L* > R¥-ı ©, = ®,. Conversely, if 
this relation holds, then 


L#,®, = Lı o Lž o R410, = RF-.®, 


g 


so that the left- and right-invariant tensor fields determined by ®, agree at 
every ge G. It is immediate that an invariant field must be determined by its 
value at any one element, say e, of G. I 
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(3.5) Corollary Every Lie group has a left-invariant Riemannian metric and 
a left-invariant volume element. In particular every Lie group is orientable. 


Proof We take any inner product ®, (a positive definite, symmetric 
covariant tensor of order 2) on T,(G) and apply Lemma 3.4 to ®, and to the 
volume element Q, that ®,, with a choice of orientation of T,(G), determines 
in order to obtain a left-invariant Riemannian metric ® and volume element 
Q. I 


In case G is compact we are able to say even more, as the next theorem 
and its corollary show. The corollary will make use of integration; to sim- 
plify the treatment we shall suppose G is connected (see Exercise 5). 


(3.6) Theorem An oriented, compact, connected Lie group G has a unique 
bi-invariant volume element Q such that vol G = 1. 


Proof Let Q be a left-invariant volume element on G. We claim that Q 
is necessarily right-invariant also. In order to prove this it is enough to show 
that Ad(g)*Q, = Q, for all ge G. Let X,, ..., X, be a basis of g and X;e, 
i=1,...,n, be the corresponding basis of T(G). We have seen that 
Ad(g)X; = )?-1 a,(g)X; and that g> (a;;(g)) defines a C” homomorphism 
of G > Gi(n, R). The linear transformation Ad(g)* on /\"(T,(G)) determined 
by Ad(g) acts as follows on Q,: 


Ad(g)*Q, = det(a,,(g))Q. . 

However, since G is compact and connected, the same applies to its 
image under the C*-homomorphism g > det(a,,(g)) of G to R*, the multipli- 
cative group of nonzero real numbers. However, the only compact con- 
nected subgroup of R* is {+1}, the trivial group consisting of the identity, 
hence det(a;,(g)) = 1 and Ad(g)*Q, = Q, for all ge G. By the preceding 
lemma this proves that Q is bi-invariant. 

Any other bi-invariant Q must be of the form 2Q, where / is a positive 
constant; but then vol G = fe AQ = 4 fe Q. Hence it is possible to choose 
Just one A Æ O such that vol G = +1. For the opposite orientation on G, we 
would have —Q as the corresponding unique bi-invariant volume element. 


From the existence of such a bi-invariant volume element one is able to 
deduce many important properties of Lie groups, of which the next two 
corollaries give examples. Further implications will appear later. 


(3.7) Corollary Ona compact connected Lie group G it is possible to define 
a bi-invariant Riemannian metric ®. 
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Proof Let ®, be a symmetric, positive definite, bilinear form on T,(G) 
and let Q be the bi-invariant volume element. Given X., Y,€T,(G), we 
define a function on G by 


f(g) = (Ad(g)*®.)(Xe, Ye) = P(Ad(g)X- , Ad(g) ¥e), 


the last equality being just the usual definition of Ad(g)*. Then define the 
bilinear form ®, on T,(G) by 


&(X..¥.) = | SOR 
*G 
According to Lemma 3.4, ®, determines a bi-invariant form if for every a € G 
Ad(a)*®,(X,, Ye) = ®(X,, Ya). 


The left-hand term may be written @,(Ad(a)X,, Ad(a)Y,). Applying the 
definition of ®, to this expression, we find that 


(Ad(a))*B(X,.%.) = | (Ad(gy*®,) (Ad(a)X, . Ad(a)¥,)0 
=| Ad(a)* Ad(g)*®,(X., ¥.)Q 


=| Ad(ga)t,(X., YJ, 
“G 
since Ad(a)* o Ad(g)* = (Ad(g) ° Ad(a))* = Ad(ga)*. This shows that 
Ad(a)*®(X,., Y.) = j f(Ra(g))2. 


On the other hand, R,: G > G is a diffeomorphism and Theorem 2.2 (iv) 
asserts that 


| f(gQ=| f(R.(g))R#2. 
* RaG) “G 
Since R,(G) = G and R*#Q = Q, we see that 
Ad(a)*®(X,, Y.) = | f(g)Q = O(X.. Ya). 
*G 
It follows that ® is a bi-invariant bilinear form on G. It is obviously sym- 
metric and it is easy to check that it is positive definite. Since we do so ina 


more general case below, we will omit this verification here. I 


(3.8) Remark When we use this Riemannian metric on G, we see that 
both right and left translations are isometries, that is, they preserve the 
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Riemannian metric (and also its associated distance function). We shall see 
later that as Riemannian manifolds compact Lie groups have rather inter- 
esting properties. 

In closing this section we shall give another application, not too different 
from the above, which in fact actually includes it. Let (p, V), p: G > GI(V), 
be a representation of G on a finite-dimensional real vector space V. As we 
have noted, if a basis is chosen in V, this determines a C” homomorphism of 
G into Gl(n, R), n = dim V, a special case is p = Ad with V = g. 


(3.9) Theorem If G is compact and connected and p is a representation of G 
on V, then there is an inner product (u, v) on V such that every p(g) leaves the 
inner product invariant : 


(p(g)u, p(g)v) = (u, v). 


Proof Let ®(u, v) be an arbitrary inner product on V and, given a fixed 
u, ve V, let f(g) = ®(p(g)u, p(g)v), thus defining a C” function on G. Then 
we define 


with Q denoting the bi-invariant volume element. The linearity of the inte- 
gral implies at once that (u, v) is bilinear, and it is clearly symmetric in u, v 
since the integrand is. Moreover (u,u) > 0, and equality implies u = 0, by 
virtue of the fact that f(g) > 0 on G with equality holding if and only if the 
integral vanishes. Finally, for ae G we have 


(Plaju, p(ayv) = | O(o(a)ola)u, a(g)o(ayv)e 


= I. ®(p(ga)u, p(ga)v)Q = I f (ga)Q. 


But by the same argument as in the previous proof, this is equal to 
fo £(g)Q = (u, v). This completes the proof. If we let p = Ad and V = g, we 
obtain Corollary 3.7 as a special case. 


We could state this result as follows: Each p(g) is an isometry of the 
vector space V with the inner product (u, v). Since the matrix of an isometry 
of V relative to an orthonormal basis is an orthogonal matrix, we have the 
following corollary concerning the representations of a compact group. 


(3.10) Corollary Relative to a suitable basis of V, the matrices representing 
every p(g) are orthogonal. 
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Theorem 3.9 is very important in the representation theory of compact 
Lie groups. We shall say that W c V is invariant if it is invariant for every 
linear transformation p(g). The representation is irreducible if V contains no 
nontrivial invariant subspaces; if each invariant subspace W has a com- 
plementary invariant subspace W’, such that V = W@ W’, then the re- 
presentation is said to be semisimple. In this case it is easily verified that 
V=W,@®--'@W,, where the W, are invariant irreducible subspaces. 
Applying Theorem 3.9 gives an important result. 


(3.11) Corollary If p is a representation of a compact connected Lie group 
G on a finite-dimensional vector space V, then it is semisimple. Moreover 
V=W,@-:'@W,, where for i + j the subspaces are mutually orthogonal 
and each is a nontrivial irreducible subspace. 


Proof If V is irreducible, there is nothing to prove. If V contains a 
nontrivial invariant subspace W, then its orthogonal complement W+ is also 
invariant: Let we W+ and let ve W. Then (p(g)v, p(g)w) = (v, w) = 0. Thus 
p(g)w is orthogonal to p(g)v for every ve W. Since p(g) is nonsingular, this 
means that p(g)w is orthogonal to every element of W and must then be in 
WŁ. Hence V = W@ W+, a direct sum of complementary invariant sub- 
spaces. Repeated application of this argument gives the final statement of the 
corollary. I 


(3.12) Example It is easy to see that there are representations of noncom- 
pact connected groups which do not have the property of complete reducibi- 
lity, hence cannot leave an inner product invariant. For a simple example 
consider p: R > GI(2, R) acting on V? defined by 


ptt) = (j i} 


Then p(t) acts on V?, the space of all (5), x, ye R, 


of) (00-05) 


The subspace (š) is invariant but has no complementary invariant subspace. 


Exercises 


1. Show that a Lie group G has a bi-invariant volume element Q ifand only 
if it is possible to choose a basis X,,..., X,, of T,(G) such that the matrix 
representation g > (x,,(g)) corresponding to Ad(g) lies in Si(n, R), that 
is, det(x;,(g)) = +1 for all ge G. Give an example of such a G which is 
not compact. 
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2. Show that if the adjoint representation of G is irreducible, that is, there is 
no nontrivial subspace V c T,(G) which is invariant under Ad(g) for all 
g, then any normal subgroup H c G has dimension 0 or n (= dim G). 

3. If V is a finite-dimensional vector space over R and GI(V) denotes the 
group of all nonsingular linear transformations of V onto V, then GI(V) 
is a group, isomorphic to Gi(n, R), n = dim V, with the isomorphism 
resulting from a choice of basis. Show that the C® structure on Gi(V) 
obtained from such an isomorphism is independent of this choice. 

4. In Exercise 3 let V = g, the Lie algebra of a Lie group. Prove that the 
subgroup Aut g, consisting of all elements of Gi(q) which are isomor- 
phisms of g, is closed in Gi(g). Assume that each a € Aut g is given by F,, 
where F: G > G is an automorphism and show that Ad(G) is a normal 
subgroup of Aut g. 

5. Show that the connected component of the identity Go in any Lie group 
G is an open and closed set and a normal (Lie) subgroup. Show also that 
if Go has a bi-invariant volume element or Riemannian metric, then the 
same is true of G if G is compact. 


In the following three exercises let G be a connected Lie group and H a 
closed (Lie) subgroup of G and use the notation and ideas of Section IV.9. In 
particular, we denote by n: G > G/H the natural projection and by 4: G x 
G/H — G/H the natural, transitive, left action of G on G/H. 


6. Let o denote the coset H as a point of G/H and let 4’ denote the action of 
H on G/H, obtained by restriction of 2 to H. Show that for each he H, 
Àn: G/H > G/H leaves o fixed and that the correspondence h — 1), is a 
representation of H on T)(G/H). 

7. Using z4: T(G) > To(G/H), show that To(G/H) is naturally isomorphic 
to g/h and that if these spaces are identified by this isomorphism, then 
the adjoint representation of G on g, when restricted to H, induces on 
g/b the same representation as the one above. 

8. Show that M = G/H has a G-invariant Riemannian metric if H is 
compact. 


4 Manifolds with Boundary 


The problems we wish to consider when we deal with integration make it 
useful to introduce the notion of manifold with boundary, which we shall 
define presently. Examples are a line segment or ray, a circular disk or 
half-plane, a closed n-ball, a surface with an open disk removed, and so on. 
Manifolds with boundary are important for other reasons too, for example, 
to study differentiable deformations of differentiable maps from a manifold 
M toa manifold N, we will need to define C” mappings from M x I into N. 
However, M x I is a manifold with boundary, for example, if M = S', then 
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M x I is a cylinder. What we must do, then, is extend our notions of differ- 
entiable functions and mappings, of tangent space and tensor field, and so 
on, to these slightly more general objects. In the definition of manifold with 
boundary the half-planes H” play a role analogous to that of R” for ordinary 
manifolds. 

Let H” = {x = (x!,..., x") R" | x" > 0} with the relative topology of R", 
and denote by GH" the subspace defined by 0H" = {xe H" | x" = 0}. Then 
ðH" is the same space whether considered as a subspace of R” or H”; it is 
called the boundary of H". Of course all of these spaces carry the metric 
topology derived from the metric of R”, and ôH” is obviously homeomorphic 
to R"! by the map (x!,..., x"7 1) > (xt, ..., x" 7, 0). 

Remembering now that differentiability has been defined for functions 
and mappings to R” of arbitrary subsets of R”, we see that the notion of 
diffeomorphism applies at once to (relatively) open subsets U, V of H”; 
namely, U, V are diffeomorphic if there exists a one-to-one map F: U > V 
(onto) such that F and F~' are both C” maps. Although this sounds 
precisely like the earlier definition, it is broader since U, V are not neces- 
sarily open subsets of R”, but are in fact the intersections of such sets with H”. 
If U, V c R" — ôH", then U and V are actually open in R” so that this 
definition of diffeomorphism coincides with our previous one. On the other 
hand, if Un dH" + Ø, then we claim that V œ ôH" + Ø and that 
F(U n ôH") c V a ôH". Similarly, F~'(V n dH") = Un ôH”; in other 
words, diffeomorphisms on open sets of H” take boundary points to bound- 
ary points and interior points to interior points. This follows at once from 
the inverse function theorem: U — ĉH” is open in R" and hence F must map 
it diffeomorphically onto an open subset of R”, but no open subset of H” 
which contains a boundary point, that is, a point of ðH”, can be open in R". 
Thus F(U — ôH") c V — ôH” and F~'(V — ĉH") = U — ôH". Since F and 
F`! are one-to-one on U and V, the result follows. 

We also notice the following two facts: First U ^ 0H" and V ^ 6H" are 
open subsets of 0H", a submanifold of R” diffeomorphic to R" !; and F, F~! 
restricted to these open sets in 0H” are diffeomorphisms. Second both F and 
F`! can be extended to open sets U’, V’ of R” having the property that 
U = U'n H"and V = V’ n H". These extensions will not be unique nor are 
the extensions in general inverscs throughout these larger domains. 
However, the derivatives of F and F~' on U and V are independent of the 
extensions chosen and we may suppose that even on the extended domains 
the Jacobians are of rank n. These statements are immediate consequences 
of the definition of differentiability for arbitrary subsets of R” and the fact 
that the Jacobian ofa C” mapping has its maximum rank on an open subset 
of its domain. Some further amplification of this situation is given in the 
problems. The reader will find a complete discussion in Guillemin and 
Pollack [1]. 
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(4.1) Definition A C” manifold with boundary is a Hausdorff space M with 
a countable basis of open sets and a differentiable structure Y in the follow- 
ing (generalized) sense (compare Definition III.1.2): Y = {U,, @,} consists 
of a family of open subsets U, of M each with a homeomorphism g, onto an 
open subset of H” (topologized as a subspace of R”) such that: 


(1) the U, cover M; 

(2) if U,, p, and Us, pẹ are elements of X, then pg ° p; ` and g,° pg ' 
are diffeomorphisms of g,(U ^ V) and g,(U ^ V), open subsets of H”; 

(3) X is maximal with respect to properties (1) and (2). 


Examples are shown in Fig. VI.5. 


A 
(c) 
Figure V1.5 


The U, @ are coordinate neighborhoods on M. From the remarks above 
we see that if ọ(p)e 0H” in one coordinate system, then this holds for all 
coordinate systems. The collection of such points is called the boundary of 
M, denoted 6M, and M — ôM is a manifold (in the ordinary sense), which 
we denote by Int M. If 6M = Ø, then M is a manifold of the familiar type; 
we call it a manifold without boundary when it is necessary to make the 
distinction. The following theorem follows from the first of the two facts 
remarked upon above. 


(4.2) Theorem IfM is a C” manifold (of dimension n) with boundary, then 
the differentiable structure of M determines a C®-differentiable structure of 
dimension n — 1 on the subspace 0M of M. The inclusion i: 6M —> M is an 
imbedding. 
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The differentiable structure Ẹ% on @M is determined by the coordinate 
neighborhoods U, @, where U = U ^ ôM, @ = @ |v nem for any coordinate 
neighborhood U, ọ of the atlas Y of M which contains points of 0M. 

Differentiable functions, differentiable mappings, rank, and so on, may 
now be defined on M exactly as before by using local coordinates. By virtue 
of the C” compatibility of such coordinate systems these concepts are 
independent of the choice of coordinates. We leave the verification to the 
reader. We also define T,(M) at boundary points of M. This could be done 
using derivations on C*(p) as before, but to avoid some slight complications 
we use an alternative definition. First note that in the case of H” c R”, upon 
which manifolds with boundary are modeled, we identify T,(H") with T,(R"); 
we may think of this identification as being given by the inclusion mapping. 
For xe ¢H", this defines what we mean by T,(H"). In the case of a general 
manifold M, for p€ 0M we define a vector X „€ T,(M) to be an assignment to 
each coordinate neighborhood U, ọ of an n-tuple of numbers (at, ..., x"), 
the U, œ components of X, satisfying the following condition: If (x?,..., x”) 
and (y!,..., y”) are coordinates around p in neighborhoods U, p and V, W, 
then the components («!,..., a") and (B',..., 6”) relative to U and V are 
related by 


: "(ay r 
s= $ (35 a, i= 1,...,n 
j=l dx’ o(p) 


(as in Corollary IV.1.8). What this does is attach to each pe M a T,(M) such 
that each coordinate system U, p determines an isomorphism p, taking X, 
with components (a!,...,a") to the vector } a‘(@/éx')e Tpp(H") As 
previously E,,..., E, will denote the basis determined by P(E) = €/ex', 
i= 1,...,n. Having defined T,(M) on ôM [it is already known on Int M, 
which is an ordinary manifold], we may extend all of our definitions and 
theorems to manifolds with boundary. In particular, exterior differential 
forms and the exterior calculus is still valid on manifolds with boundary. 
There is no essential change in the definitions or proofs. 

For many purposes, in particular for our discussion of Stokes’s theorem 
in the next section, we could use an (apparently) weaker, but closely related 
notion. 


(4.3) Definition A regular domain D on a manifold M is a closed subset of 
M with nonempty interior D such that if pe ôD = D — D, then p has a 
cubical coordinate neighborhood U, ọ with g(p) = (0, ..., 0), p(U) = Cz(0), 
and @(U ^ D) = {x €C%0)|x" > 0}; thus x" = 0 on @D. 


It follows that @D is a regular submanifold. We remark that if D is 


compact, then it is a domain of integration on M. It is a straightforward 
matter to check that D, with the topology and differentiable structure induced 
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by M is a manifold with boundary. All of our examples can be seen to be of 
this type: H” and the closed unit ball B" are regular domains of M = R", 
N x I is a regular domain of N x R, and the set D obtained by removing 
from a manifold M a diffeomorphic image of an open ball is a regular 
domain. Further examples are given in the exercises. 

It is a fact—somewhat too difficult to prove here—that any manifold M 
with boundary can be realized as a regular domain of a larger manifold M’. 
The basic idea is simple enough: one simply takes two copies of M, say M, 
and M,, and “glues” them together along their boundaries, identifying 
corresponding boundary points. The resulting manifold, called the double of 
M, contains M as a regular domain. Figure V1.6 shows the doubles of the 
examples of Fig. VI.S (which are shaded in Fig. V1.6). For details the reader 


Figure V1.6 


should consult Munkres [1, Section 6]. For regular domains it is simpler or 
at least more intuitive to define the tangent space at boundary points and to 
define the calculus of exterior differential forms, since we may do so by 
restriction of the corresponding objects on M. This could be taken as further 
evidence that the reader who wishes to carefully check the details of the 
extension to manifolds with boundary of the concepts and Operations we 
have used for manifolds without boundary, will encounter no serious obsta- 
cle. In fact, by definition M is locally diffeomorphic to H", which is a regular 
domain of R”. 

We consider next the question of orientability. A manifold M with non- 
empty boundary is orientable provided that it has a covering of coordinate 
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neighborhoods {U,,@,} which are coherently oriented, that is, 
U, ^ U, # Ø implies gy ° g; ` has positive Jacobian determinant (or equiv- 
alently, preserves the natural orientation of H"). This is equivalent to the 
existence of a nowhere vanishing n-form Q on M. The proof is the same 
except that when we speak of a partition of unity on M associated to a 
regular covering {U;, V;,@,} we limit ourselves to a regular covering by 
cubical coordinate neighborhoods concerning which we impose the follow- 
ing slight restriction: if U; ^ ôM + Ø, then g{U,) = C3(0)^ H” and 
o(V,) = Ci(0) ^ H”. With this modified definition of regular covering we 
still have a regular covering (by definition locally finite) refining any open 
covering {4,} of M and an associated C% partition of unity {f} on M. We 
remark that those U;, V;, p; of the regular covering that intersect 0M deter- 
mine a regular covering U; = U; 0M, V,=V,;06M, and Q; = gilo; of 
ĉM and the associated partition of unity restricts to an associated partition 
of unity {J; = filem} on CM. 


(4.4) Theorem Let M be an oriented manifold and suppose CM is not empty. 
Then ĉM is orientable and the orientation of M determines an orientation of 
ðM. In particular, this holds for ðD if D is a regular domain on M. 


Proof Since @M is an (n — 1)-dimensional submanifold of M, its tan- 
gent space at each point may be identified with an (n — 1)-dimensional 
subspace of T,(M); we denote this subspace by T,(@M). We shall show that 
there is a distinction between the two half-spaces into which T,(¢M) divides 


T,(M) which is independent of coordinates. Suppose that U, ọ and V, y are 
coordinate neighborhoods of pe ôM with local coordinates (x’,..., x") and 
(yt, ..., y"), respectively. By our definitions of coordinates of boundary 
points, the last coordinate x", or y” is equal to zero if the point in U or V, 
respectively, is on OM, and positive otherwise. Writing y' = y'(x', ..., x"), 
i=1, ..., n, for the change of coordinate functions, we have 
0 = y"(x!,...,x774,0) so that (@y"/Ox") oq = = (Ay"/Ox"~ "gig = 0 for 
every qe U n ôM. It follows that the Jacobian matrix then has the form 
ô 1 ô n-i 
D icy, E 
Ox Ox 
ey} dy" i 
Dib o@)= Geert) Siete a 0 
(y p ) ôx" 1 6x" 1 
ôy! 2 ôy 
ox" ôx" ax" | ° 


Since the Jacobian is nonsingular, éy"/éx” + 0 at ~(q); in fact, it must be 
positive. For let (q) = (a’, a*,...,a"~', 0) and consider f(t), defined by 
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f(t) = y"(a',..., a71, t). Then f’(0) = (@y"/Ex")4 can certainly not be neg- 
ative since f(0) = 0 and f(t) > 0 in some interval O <1 < ô; therefore 
ðy"/ðx" > 0 at w(q) as claimed. 

If U, g and V, w are oriented neighborhoods of M, then Di m7") has 
positive determinant so both ¢@y"/éx” and the (mn — 1) x (n — 1) minor de- 
terminant obtained by striking out the last row and column have the same 
sign. This minor determinant is exactly the determinant of D(ọ o @~*), the 
change of coordinates from U = UN ôM, 6 = olg to V = V n ôM, Y = wip 
on the submanifold M. Thus the neighborhoods on ðM determined by 
oriented neighborhoods on M are coherent and determine an orientation on 
ôM. 


(4.5) Remark Using the notation of the proof, let qe U ^ V be a bound- 
ary point of M and let X, € T,(M). Because (ĉy"/2x")pa > 0, it follows that 
when we express X, in the coordinate frames of either U, ọ or V, y, 


X, = aE, + eo F a" Ep1 + a"E, = B'F, ee + aa Fpa- + BF,» 


then a” and f" have the same sign. (This fact does not depend on the coordin- 
ates being oriented.) It follows that the vectors of T,(M) — T,(@M) fall into 
two classes, those whose last component is positive—which we call inward 
pointing vectors at pe ĝM—and those for which the last component is 
negative—which we call outward pointing vectors (see Fig. VI.7). Those for 
which the last component vanishes are tangent to 0M. This classification is 
independent of the orientation of M. The orientation above on ðM could be 
described as follows: If pe ôM and X, is an inward pointing vector, then a 
basis E,,,.-., Eq,—1yp Of T,(0M) is oriented if and only if E,,,..., En- 1p Xp 
is an oriented basis of T,(M). Whenever M is oriented, 0M will denote this 
orientation on the boundary. For Stokes’s Theorem (below) it is convenient 


Figure V1.7 
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to use the opposite orientation —0M when n is odd. We will denote this 
orientation on the boundary by 0M. Thus êM = (—1)" 6M 


(4.6) Remark We have noted that there are difficulties in gluing two 
manifolds with identical boundaries together along their boundaries. We 
can, however, describe a special case which will give some idea of the impor- 
tance of such operations. Let M,, M, be two manifolds (without boundary) 
of dimension n and let U;, p; be coordinate neighborhoods of points p;e M,, 
i = 1,2. We suppose that o,(p,) = (0,..., 0) and that @(U;) = B}(0) in each 
case and we set V; = g; '(B,(0)). Then M; = M; — V,,i = 1, 2, is a manifold 
with boundary, indeed ọ;(2M;) = S"”'. The manifold obtained by gluing 
M\ to M’ along the boundaries is called the connected sum of M, and M3, 
denoted M, # M, (see Fig. VI.8). In order to define it without loss of 
differentiability, we will actually remove only ø” '(B,,2(0)) from each M, to 
obtain M”, leaving in each U, a “collar,” C; = @; ' (B3)2(0) — B,)2(0)) 
parametrized by ; '{(x,t) = (x1,..., x, O||Ixll = 1, 3<¢< 3}, a diffeo- 
morph of S"~! x (4, 3). We identify points of C, corresponding to (x, t) with 
those of C, corresponding to (x, 2 — t). 

Any closed surface (compact 2-manifold) can be obtained as the con- 
nected sum of copies of S? and T? if orientable, P?(R) and T? if nonorientable 
(see Wallace [1], Massey [1], and Hirsch [1]). 

This whole procedure and ones similar to it have become very important 
in the recent years and are intimately related to the attempt to classify or list 
all simply connected n-dimensional compact C” manifolds—a problem 
which was solved long ago for closed surfaces but is still unsettled in dimen- 
sion three. Oddly enough there has been more success in higher dimensions! 
It is not yet known whether there exist simply connected, compact, orient- 
able manifolds of dimension three other than S?; that there are none is the 
famous Poincaré conjecture. Similar questions in dimension > 5 were 


1 M 


Figure V1.8 
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answered by Smale [1]. Milnor [3] has shown that every compact 3-manifold 
can be represented uniquely as a connected sum of 3-manifolds which 
cannot be further decomposed into connected sums. 


Exercises 


1. Let M be a manifold without boundary and f: M > R a C~” function. 
Suppose that df# 0 onf 1(0) and show that M* = {pe M| f(p) > 0} 
is a regular domain of M. What is its boundary? 

2. Let C be an imbedded image of S! in R*. Show that there is an e > 0 
such that N = (J, ec B,(x) is a manifold with boundary. Show that it is 
diffeomorphic to the solid torus. What can be said about the comple- 
ment in R? of Ñ (the interior of N)? 

3. Show that if M has a Riemannian metric, then there is a uniquely 
determined vector field X defined at each point of @M such that X, is 
inward pointing, is orthogonal to T,(¢CM) for each pe CM, and has unit 
length. 

4. Show that if M is a manifold with boundary, then it is always possible to 
choose a vector field X defined at each point of CM such that X is 
inward pointing. Given such X and an Q on M which is an n-form 
determining the orientation, then show that the form œw = (— 1)" 1i(X)Q 
determines the orientation of ôM and i(X)Q that of ÕM. [i(X)Q is defined 
as in the Exercise of Section V.8.4]. 

5. Let M be a connected, closed, regular 2-manifold in R? such that R? — M 
is not connected. Show that it is possible to define a continuous field of 
unit normal vectors to M and that R? — M has two components, each 
with M as boundary. 

6. Let U, V be open subsets of H” and F:U>V, G: V —> U diffeomor- 
phisms which are inverse to each other. Discuss the possibility of finding 
extensions F’, G’ to open subsets U’, V’ of R” containing U, V, respec- 
tively, such that G’> F’ = id,, and F’o G’ = idp.. 


5 Stokes’s Theorem for Manifolds with Boundary 


We consider an oriented manifold M with possibly nonempty boundary 
€M, oriented by the orientation of M. We shall consider only oriented coor- 
dinate neighborhoods U, ø in what follows. If U œn êM + Ø, then we denote 
by U, ® the corresponding neighborhood U = Un 6M, = |e on ôM. 
All of the concepts used in defining the integral extend to M; namely the 
definitions of content zero, domain of integration, and so on. In particular 
OM has measure zero and, if compact, has content zero. This follows from 
corresponding properties of 0H" (and Corollary 1.14). A cube Q is associated 
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with U, ọ as in Section 2 unless U n 0M # Ø, in which case we assume that 
Q has a “face” on ðM, that is, 


o(Q n ôM) = {xeR"|0<x'<1 and x" =O}. 


In this case we note two facts: (a) = Q ^ ôM is a cube of ôM associated 
with U, @ and (b) Ò = g '({xeER"|0 < xi < 1,1 <i<n—1;0<x"<1}), 
that is, the interior of Q has a different image in R" than it has when 
U c Int M. 

Taking these minor modifications into account, the definition of fu Q is 
exactly as in Section 2 and the integral of an integrable n-form has the same 
properties as before. Indeed, if M is a compact regular domain in a manifold 
N, then it is necessarily a domain of integration in N and fu Q= fn ku Q so 
there is nothing new to define in this case! The same comments apply to the 
integral over a Riemannian manifold with boundary and to the definition of 
vol M when M is compact. 

Now suppose M is both oriented and compact and that œ is an (n — 1) 
form of class C! at least on M. We have an important relation between the 
integral of dw over M and i*w, the restriction of w to ôM (i: 6M > M is the 
inclusion mapping). To simplify the statement of the theorem we let 6M 
denote ôM, the boundary with the orientation induced by M, when n is even 
and — ôM, the boundary with the opposite orientation when n is odd; thus 
OM = (— 1)" ôM (cf. Remark 4.5). 


(5.1) Theorem (Stokes’s theorem) Let M be an oriented compact mani- 
fold of dimension n or a compact regular domain of an oriented manifold and let 
ôM have the orientation ÕM described above. Then we have 


| do =| ito. 


M “eM 


When 0M = Ø, the integral over M vanishes. 


Proof According to our definitions it is enough to establish the theorem 
for an w whose support is contained in the interior O ofa cube Q associated 
to a coordinate neighborhood U, p. Suppose w has its support in Q and 
x!,..., x" are the local coordinates. We may suppose that in these coordin- 
ates œw is expressed as 


po *(w) = $ (-1P A dx aa dx! a dxdt) ace a dx". 
Then we have 


n i 
oy '*(dw) = do~ '*(w) = ( 2 a) dx! a+ adx" 
jan OX! 
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so that 


| w=] (3 5a) -5j E dx? + dx" 


This follows from the definition of integration on M and the iterated integral 
theorem. The expression on the right may be rewritten; consider the jth 
summand only and integrate first with respect to the variable x’. This gives 
an (n — 1)-fold iterated integral 

KJ 1 * 1 k E 2 
(+) | | ei es a Ne ce ered 


=g “0 


, ME a ~ 
A en VG ceed X”) dx! oe os dx”, 


where we indicate by dx} that this differential is to be omitted. The sum of 
these (n — 1)-fold iterated integrals for j= 1,...,n gives fy dw if 
supp(@ w) < Q. Two cases can occur: either Q ^ @M.= Ø, in which case 
e(Q) = {x]O<x' <1, i=1,...,m; or @NOM+#@, in which case 
p(Q) = {x]0< xi <1,i=1,.. ed 0 < x" < 1}. In the first case, using 
supp œw c Q, we see that 4 = 0 if any x/ = 0, 1. Hence each of the inte- 
grands in («) vanish and Jy dw = 0. On the other hand w restricted to ôM is 
the zero (n — 1)-form since supp œ ¢ Q which has no points on 0M. Thus 
fu dœ = 0 = fay i*w and Stokes’s theorem holds for this case. 
In the second case we again have all of the integrands in (+) equal to zero 
except the one corresponding to j = n; therefore 
1 vl 

| do = -| a A(x... x" 80) dx! dx}, 

“M “0 
On the other hand we may evaluate fam i*w using the fact that i*œ has its 
support in @ = Q n ôM so that its expression in the local coordinates U, 6 
(obtained by restriction of U, @) collapses to 


Oo t*(i*w) = (1) A, gx 4, Ode A adx"! 


[We may obtain this from the expression for w by applying the correspond- 


ing inclusion i: (x',..., x" ')— (xt, ..., x"71,0) in the local coordinates 


and noting that i* dx" = 0.] This will give 
1 1 
[ i*œ =(-1)"! f oe | POO ge” 2, 0) dx! + dx" t., 
“eM “0 “0 
Thus, in the case where supp œ c @ and Ôn ôM + Ø, we find that 
[ dw = (— 1)" f i*w = | i*w, 
“M “eM +éM 


with the right-hand integral over 0M when n is even and —@M when n is 
odd, that is, over 0M. I 
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When n = 1, M = [a,b] = R and Stokes’s theorem is the fundamental 
theorem of calculus. Green’s theorem and Gauss’s theorem are also special 
cases, giving some idea of its great importance. 


(5.2) Example (Greens theorem) Let M be a bounded regular domain 
of R?, that is, the closure of a bounded open subset of the plane bounded by 
simple closed curves of class C®; for example, let M be a circular disk or 
annulus. Then ðM is the union of these curves (in the cases mentioned a 
circle or a pair of concentric circles). If œ is a one-form of class C’ on M, 
then, using the natural Cartesian coordinates, we have œw = a dx + b dy. We 
may suppose, by definition of differentiability on arbitrary sets, that a, b are 
restrictions of C! functions on some open set containing M. We have 
dw = ((@b/@x) — (@a/éy)) dx ^ dy and by Stokes’s theorem 


- b 
i fas >| dx Ady = 5 adx + bdy. 
According to Remark 2.3, the left-hand side is the ordinary Riemann integral 
over the domain of integration M c R?. On the other hand, if we think of 
M as a one-dimensional manifold and cover it with (oriented) neighbor- 
hoods, it is clear that its value is that of the usual line integral along a curve 
C (or curves C;) oriented so that as we traverse the curve the region is on the 
left. (This is further discussed below.) Thus the equality above may be 


written so 
a 
We-a)* xdy = Z | adx + bay, 


i "C 
which is the usual statement of Green’s theorem. 


(5.3) Example Let M be a regular domain of R?, that is, the closure of a 
bounded open set bounded by closed C® surfaces. Examples are the ball of 
radius 1, which is bounded by the sphere S’, or the region interior to a torus 
T2, obtained by rotating a circle around a line exterior to it. Consider the 
two-form œ = P dyndz + Q dzadx + Rdx Ady, where P, Q, R are C' 
functions on some open set of R? ty M. We have 


cP uae 
dw = dx A dy a dz 
i (= 7 ay * ez =) KAENA 


and Stokes’s theorem asserts that 


[ OF 02 4 GR dx dyn dz = | P dyadz + Q dzadx + R dx^ dy. 
m\Ox y @z -ôM 


If we use Remark 2.3 and our definitions to translate this into a Riemann 
integral over a domain and a surface integral over the boundary, respec- 
tively, then we obtain the usual divergence theorem of advanced calculus. 
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(5.4) Example A third example is obtained if we consider M to bea piece 
of surface imbedded in R? and bounded by smooth simple closed curves, for 
example, a sphere with one or more open circular disks removed, thus 
leaving boundary circles, which are 0M. Since dx, dy, and dz may be con- 
sidered, by restriction, as one-forms on M or on 6M, any one-form œw on M 
may be written: œ = A dx + B dy + C dz, where A, B, and C are Ct func- 
tions on M. In this case Stokes’s theorem asserts that 


eC ôB ðA OC CB CA 
<<- z+ {[—-— — — =] dxady 
| (S a ee ( = es (a | rene 


The integral on the left-hand side may be converted by the procedure used in 
defining integrals of forms on manifolds to an ordinary surface integral over 
the surface M in R? and that on the right to a line integral. When this is done 
one obtains the usual Stokes theorem of advanced calculus. 


*M 


A dx + B dy + C dz. 
M 


Often these examples of special cases of the general Stokes theorem are 
stated in terms of vector calculus and vector operations such as gradient, 
divergence, and curl. To establish the equivalence would require use of the 
duality between vectors and covectors and other such relations on R” which 
use the fact that R” is a Riemannian manifold. Basically these stem from the 
natural isomorphism of a Euclidean vector space and its dual (and its exten- 
sion to a duality between covariant and contravariant tensors). We do not 
need these operations in what follows so they will not be taken up here; 
some indications are given in the exercises. 

It is important to note that the version of Stokes’s theorem proved above 
is deficient in the following sense: it holds only for smooth manifolds with 
smooth boundary. Thus, for example, our proof does not even include the 
case of a square in R? or an open set of R? bounded by a polyhedron. The 
difficulty in these cases is not so much with the analysis and integration 
theory, as with describing the regions of integration to be admitted and with 
giving precise definitions of orientability and induced orientation of 
the boundary. The search for reasonable domains of integration to validate 
Stokes’s theorem usually leads to the concept of a simplicial or polyhedral 
complex, that is, a space made up by fastening together along their faces a 
number of simplices (line segments, triangles, tetrahedra, and their generali- 
zations) (Fig. VI.4) or more general polyhedra (cubes, for example). Since it 
can be shown (see Munkres [1]) that any C” manifold M may be “ trian- 
gulated,” which means that it is homeomorphic (even with considerable 
smoothness) to such a complex, the integral over M becomes the sum of the 
integrals over the pieces, which are images of simplices, cubes, or other 
polyhedra as the case may be (compare Remark 2.7). The strategy is then to 
reduce the theory (including Stokes’s theorem) to the case of polyhedral 
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domains of R”. This approach is particularly important for those interested 
in algebraic topology and de Rham’s theorem. It is very clearly set forth, for 
example, by Singer and Thorpe [1] or Warner [1]. 

For many purposes, integration of forms over C+! (but not one-to-one or 
diffeomorphic) images of simplices and polyhedra in a manifold is very 
useful. Some idea of how this works is given in the following example. 


(5.5) Example (Line integrals in a manifold) Let [a, b]=rteR| 
a < t < b}and let F: [a, b] > M bea C! mapping whose image is, then, a C! 
curve S on M. If œ is a one-form on M, we define fs w by 


[o= | F*o. 


This is called the line integral of w along S. In general S is not a submanifold 
of M; it can be very complicated. However, the right-hand side is the integral 
of a one-form, F*w = f(t)dt, on a one-dimensional manifold with bound- 
ary; thus P b 
[o= | fO d. 
“Ss “a 
Exactly as for line integrals in R”, we may prove that the value of the integral 
does not depend on the parameter as long as the orientation of S is preserved 
(Exercise 5). Thus the integral of œw over an oriented C' curve S of M is 
defined. When we reverse the orientation, traversing S in the opposite sense, 


it changes the sign of the integral. We write {_5@ = -fs œ. 
More generally, let S be an oriented continuous and piecewise differen- 
tiable curve, that is, we consider S to be a union of curves S,, S,,..., S, such 


that each S; is C! and the terminal point of S; is the initial point of S;,, 
(terminal and initial point make sense since we are dealing with oriented 
curves). Then we define the integral over S by 


r 
|o= ` | W, 
“Ss i=l “S; 


thus extending to this case the definition of line integral on a manifold. This 
definition reduces to the usual one when M = R". In fact we could have used 
that as a starting point by subdividing the curve Š on an arbitrary manifold 
into a finite union of C! curves S,, each in a single coordinate neighborhood 
and evaluated the integral over each S; in local coordinates, that is in R”. 


(5.6) Example Consider the special case w = df, where fis a C™ function 
on M. (This implies that dw = 0.) In this case the value of the line integral 
along the piecewise differentiable curve S from p to q is given by 


| f=) - Fo). 


In particular, it is independent of the path chosen. The verification is 
Exercise 1 at the end of the section. Note that if p is held fixed, then f (q) is 
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given at each q by adding f (p) to the value of the line integral along any 
piecewise C' curve from p to q. Thus f is determined to within an additive 
constant by the line integral (assuming M connected). 

The fact that a (line) integral of a one-form œ over an oriented piecewise 
differentiable curve § has been defined enables us to state Stokes’s theorem 
for a polygonal region Q of R? bounded, as it is, by an oriented piecewise 
linear (simple closed) curve § = @Q. We carry this out for the unit square. 


(5.7) Theorem Let w be a C° one-form defined on Q = {(x, y): 0< x <1, 
0 < y< 1} and let S be the boundary of Q traversed in the counterclockwise 
sense. Then fo dw = fs a. 


Proof Let w = adx + b dy, where a,b vanish outside Q and are C! 
functions on Q. Then dw = ((@b/éx) — (@a/éy)) dx Ady on Q and by 
Remark 2.3, 

. '.1é@b ĉa 
dw = —-—-.~]dxd 
l, j l l, (a 5) os 


1 


= [HC y) = 600, yJ) dy = | Tal 1) = alx, 0) dx 


The orientation is that given by the standard coordinate system in R?. On 
the other hand the integral over the boundary is 


| 


4 KJ 
w=} | adx + bdy 
5 i=1°S; 
0 


1 1 0 
=Í a(x,0)dx + [ b(1, y) dy + | a(x, 1) dx + | b(0, y) dy. 
“0 “0 1 -1 
This is because dy = 0 on the vertical sides and dx = 0 on the horizontal 
ones. (See Fig. VI.9.) Comparing the values of the integrals shows that the 
theorem is true. | 


ba 


Figure VI.9 
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We remark that this essentially mimics our earlier proof, and from the 
procedure used it is clear that we could state and prove this theorem for 
rectangles of any dimension and even for triangles, tetrahedra, and other 
simplices although in these cases, as we mentioned, some machinery is neces- 
sary to describe proper orientations on the faces. 


Exercises 


1. Prove the formula of Example 5.6 for fs df by first proving it for curves 
of class C! which lie in a coordinate neighborhood. 

2. Evaluate the line integral of œ = x?y dx + x dy on M = R? along the 
radial path from (0,0) to (1,1) and along the path consisting of the 
segments (0,0) to (1,0) and (1,0) to (1, 1). Determine whether this 
integral is independent of the path. 

3. Show that all line integrals of œ = Pdx + Q dy + R dz in R? are 
independent of the path only if the value of the integral over any closed 
(piecewise C!) path is zero. Use this and Stokes’s theorem to obtain a 
condition on P, Q, R which is sufficient to show independence of the 
path. [Assume w is defined on all of R° 

4. In Example 5.3 let M be the unit ball and 0M the unit sphere. If P = x’, 
Q = y?, and R = z?, compute both sides of the equation giving Stokes’s 
theorem. 

5. Suppose that S is an (oriented) C' curve on a manifold M given paramet- 
rically by a mapping F: t> F(t), a < t < b of [a, b] into M. Suppose - 
t = f(s), c < s < d is a change of parameter on S. Show that the valuc of 
the line integral over S of any one-form w is unchanged if f(s) > 0, that 
is, if the orientation of G = F o fis the same. 

6. Prove Stokes’s theorem for a triangle in R? and a cube in R°. 


6 Homotopy of Mappings. The Fundamental Group 


One of the most basic ideas used in the study of mappings from one 
space to another is that of homotopy. Two mappings are said to be homo- 
topic if one can be “deformed” into the other through a one-parameter 
family of mappings between the same spaces. Sometimes further conditions 
are imposed on the family of mappings as we shall see. The basic definition 
can be stated as follows. 


(6.1) Definition Let F,G be continuous mappings from a topological 
space X to a topological space Y and let J = [0, 1], the unit interval. Then F 
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is homotopic to G if there is a continuous mapping (the homotopy) 
H:X xI7-Y 


which satisfies the conditions: F(x) = H(x,0) and G(x) = H(x, 1) for all 
xe X.If X and Y are manifolds and F, G are C”, we define a C” or smooth 
homotopy by requiring that H be C® in addition to the conditions above. 


We remark that H,(x) = H(x, t) does indeed define a one-parameter 
family of mappings H,: X > Y,0 < t < 1, with F = Hy and G = H,. The 
formulation of the definition emphasizes the simultaneous continuity in 
both variables t and x. 

Some brief comments on the C® case: If 0X = Ø, then X x lisa 
regular domain of X x R and is a manifold with boundary. Indeed, 
O(X x I) = X x {0}U X x {1}, so C” is perfectly well defined. If 0X # Ø, 
then X x I isnot a manifold with boundary [consider X = B7(0), the closed 
unit disk, for example]. However, it is a reasonably nice domain of X x R 
which is a manifold (with nonempty boundary), so only minor technical 
problems arise. We remark however, that when both X and Y have non- 
empty boundaries, there are cases in which it is natural to require that 
H{0X) < ðY for 0 < t < 1, which is closely related to the generalization 
below. 

When the class of continuous maps from a space X to a space Y is 
considered in its entirety, then homotopy of maps forms an equivalence 
relation, and for many purposes it is the equivalence class of the map that is 
important and not the particular representative. We shall illustrate this in 
great detail in a special case, namely X = 1, the unit interval. Before doing 
this we mention a useful generalization of our definition: Suppose (X, A) 
and (Y, B) are pairs consisting of spaces X and Y and closed subspaces 
Ac xX and Bc Y. Consider F,G: X — Y continuous maps such that 
F(A) = Band G(A) = B; F and G map the pair (X, A) into the pair (Y, B) 
continuously. We say that F and G are relatively homotopic if there exists a 
continuous map H: X x I > Y such that H(A x I) € B, H(x, 0) = F(x), 
and A(x, 1) = G(x). We have added to Definition 6.1 the requirement that 
H,(A) < B for0 < t< 1. When A = Ø = B, the definition reduces to the 
original one. We will write F ~ G to indicate that F and G are (relatively) 
homotopic; we justify this notation as follows. 


(6.2) Theorem Relative homotopy is an equivalence relation on the contin- 
uous maps of (X, A) into (Y, B) for any topological spaces X and Y and closed 
subspaces A and B, respectively. 


Proof The relation is reflexive since H(x, t) = F(x) is a homotopy of 
F(x) with F(x). It is symmetric as well; given a homotopy H(x, t) of F to G, 
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then A(x, t) = H(x, 1 — t) is a homotopy of G to F. Finally, suppose 
F, ~ F, and F, ~ F, by homotopies H, and H,, respectively. Then we 
define H(x, t) a homotopy of F, and F, by 

|H (x, 2t), 0 < 
|Ha(x, 2t- 1) 4<t< 
It is easily verified that H(x, t) is continuous, and that all these maps take A 
into B for every t between 0 and 1 inclusive. We leave the C” case to the 
exercises. | 


A(x, t) = 


Homotopy of Paths and Loops. The Fundamental Group 


As a first application of the concept of homotopy we will consider homo- 
topy classes of continuous maps of the unit interval J = [0, 1] into a mani- 
fold M. A map f: I > M of this type is called a path, f (0) its initial point, and 
f (1) its terminal point. We shall consider homotopy classes of paths under 
the additional restriction that the homotopy keep initial and terminal points 
fixed, that is, H(t,0) and H(t, 1) are constant functions.’ This is exactly rela- 
tive homotopy for (J, {0, 1}) and (X, {b, d}), b = f (0), d = f(1). Given a mani- 
fold M, fix a basepoint b on M and consider the paths with b as initial point. 
If b is also the terminal point, then the path is called a loop; thus a loop is a 
continuous map f: I > M such that f(0) = b = f(1). We denote its homo- 
topy class by [f], meaning always relative homotopy. Among these classes is 
that of the constant loop e,(s) = b, O < s < 1. If this is the only homotopy 
class and M is connected, then we say M is simply connected; this means that 
every loop at b can be deformed over M to the constant loop. It is rather 
easy to see (Exercise 1) that this property does not depend on the choice of b 
and is equivalent to the statement that any closed curve (continuous image 
of S') may be continuously deformed to a point on M. 

Paths, loops, and their homotopy classes are very useful in the study of 
spaces from the point of view of algebraic topology, for an important objec- 
tive is to assign algebraic objects, such as groups, to spaces in such a way 
that they depend only on the topology of the space, that is, are invariant 
under homeomorphism, and thus “measure” topological features. We shall 
illustrate this process in this chapter for the case where our spaces are 
manifolds. The restriction to manifolds is not essential but is for convenience 
only. 

If M is a connected manifold and f, g are paths on M with the terminal 
point f(1) coinciding with the initial point g(0), we may clearly combine 
these to a single path h after readjusting the parametrization; in fact, 


t In our notation, here the first variable of H is the deformation parameter t. 
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is obviously a continuous map h: I > M traversing the image of f followed 
by that of g. We shall call this the product of f and g, denoted f» g. This 
product has the following properties with respect to (relative) homotopy: 


(i) fx (g*h) ~ (fx g)» h. 

(ii) Let f(1)= b = g(0) and suppose f =e,. Then e,*g ~ g. Sim- 
ilarly, if g = e,, then f xe, ~ f. 

(ii) Iff; ~ fand g, ~ gz, then fi * gi ~fa * ga. 

(iv) If g(s)=f(1-— s), and a= f(0), b = f(1), then f* g ~ ea and 
J*f~ ey. 

(v) If F:M>N is continuous and f’=Fof, g=Feog, then 
(fxg) =f *g'. 

The verification of most of these properties is left as an exercise. In each 
case a homotopy H(t, s) having the given properties must be constructed ; as 
a sample we verify (ii). By definition e, * g(s) = b for [0, 3] and e, * g(s) = 
g(2s — 1) for [5, 1]. We define H(s, t) in the following way: 


b, O<s<3(1-—t) and O<t<1, 


H(ts)={ (2s-14¢ 
( 1+t 


) 4(1-t)<s<l. 


It is useful to see how this map H: I x I + M maps various portions of the 
unit square in Fig. VI.10. The shaded portion is mapped onto b = g(0) and 
each horizontal segment in the unshaded part, as for example the dotted line, 
is mapped onto the image of g with the parametrization modified propor- 
tionately. For property (iv) for f * g we have a diagram as in Fig. VI.11 with 
the base mapping according to f » g, the top and two sides mapping onto a 
and the top and two sides of each inner square going onto f(4(1 — t)) = 
g(3(1 + t)). In verifying properties (i), (ii), and (iv) (Exercise 7) such diagrams 
are useful. 

We are now ready to give an example of a group “assigned” to a mani- 
fold M, the fundamental group of M (at the basepoint b). It is an important 


(0,0) g "0 $ (0,0) f b g (1,0) s 
Figure VI.10 Figure VI.11 
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algebraic invariant of a topological space or manifold and is often called the 
Poincaré group after one of the founders of algebraic topology. 


(6.3) Theorem Let 7,(M,b) denote the homotopy classes of all loops at 
be M. Then n,(M, b) is a group with product [f][g] = [fx g]. If F: M > N is 
continuous, then F determines a homomorphism F ,: n,(M, b) > 7,(N, F(b)) 
by F,[f]=[F° f]. If G is homotopic to F relative to the pairs (M, b) and 
(N, F(b)), then F, = G, . Finally, when F is the identity mapping on M, F, 
is the identity isomorphism, and (F > G), = F, ° G, for compositions of con- 
tinuous mappings. 


Proof The product is well defined since (iii) above assures us that 
[f « g] is independent of the representatives f and g chosen from [f] and [g]. 
By (i) it is associative; and (ii) and (iv) give the existence of an identity [e,] 
and inverse. Thus z,(M, b) is a group. It follows from (v) that F: M > N 
induces a homomorphism F,, and the last statement of the theorem is 
immediate from the definitions. Finally if H: M x I > N is a homotopy of F 
and G, then A(t, s) = H(f (s), t) is a homotopy of the loops F ° f and G- f; 
hence F,[f] = G,L/]. I 


We have some immediate corollaries, the first of which spells out the 
meaning of the statement that the fundamental group is a topological 
“invariant.” 


(6.4) Corollary If M, and M, are homeomorphic by F: M, > M, and 
F(b,) = b, then F „ is an isomorphism of the corresponding fundamental groups 
74(M,, by) = n(M2, b2). 


Proof If F:M,-—M, is the homeomorphism and G: M, > M, its 
inverse, then F, and G, are isomorphisms, since F, o G, and G, ° F, are the 
identity isomorphisms by the last statement of the theorem. | 


If the identity map of M to M is homotopic to the constant map of M 
onto one of its points b, then M is said to be contractible (to b). For example, 
any open subset of R” which is star-shaped with respect to a point b is 
contractible since H(x, t) = (1 — t)x + tb is such a homotopy. For contrac- 
tible spaces we have the following: 


(6.5) Corollary If M is contractible to b, then n,(M, b) = {e}, the identity 
element alone. It follows that M is simply connected. 


Proof If fis a loop at b, then it is homotopic to the constant loop e, by 
A(t, s) = H({(s),}). This shows that 2,(M,b) = {1}. To deduce simple 
connectedness from this is exactly Exercise 1. It is even simpler to prove it 
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directly from the definition using again the mapping H. Of course, there are 
simply connected spaces which are not contractible, the sphere S”, n > 1, 
being the simplest example (see Corollary 7.14 below). i 


An interesting application of these ideas arises when we consider line 
integrals along piecewise differentiable paths on M. Let w be a one-form on 
M and suppose p, qe M. If S,, S are two such paths of M from p to q it is 
natural to ask whether or not 


One knows that in general they are not equal, even in very simple cases (see 
Exercise 10). However, the standard theorems of advanced calculus on 
independence of path may be generalized to manifolds with essentially the 
same proofs. We shall state the results and sketch the proofs. 


(6.6) Theorem Let w be a one-form on a manifold M such that dw = 0 
everywhere, and let S, S, be homotopic piecewise differentiable paths from 
pe M to qe M. Then 

| o=f o 

Proof (in outline) If S, and S, are C! curves homotopic by a differ- 
entiable mapping H of I x I into M, then this result is a straightforward 
application of Theorem 5.7 (Stokes’s theorem for the unit square. In the 
general case the (continuous) homotopy H of the piecewise differentiable 
curves must be altered as follows. First J x I is subdivided by vertical and 
horizontal lines (Fig. VI.12) so that it is differentiable on each boundary 
segment and so that H carries each subrectangle Q;; into a single coordinate 
neighborhood U. Then the techniques of Theorem V.4.8 are used to alter H 
successively to a homotopy A which is differentiable on each Q;;. From this 
point the proof follows the usual one of advanced calculus. The argument 
is as follows: 

The new homotopy Ĥ maps the edges of the square Q = I x I into the 
paths S4, q, — S2, p, respectively, as we go around ĝQ counterclockwise. The 
images of the left and right vertical edges are the constant paths p and q. (See 
Fig. VI.12.) Since the line integral of œw over a constant path is zero, we have 


| fre = je + I? = \ — |e 


On the other hand, it is easy to check that if we denote the oriented squares 
of the subdivision by Q;;, then line integrals over the same path in opposite 
directions cancel out, and we have 


| Ato => [ A*o. 


oQ i, j *êQij 
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Figure V1.12 


By Theorem 5.7 and the remarks preceding it, 
| Ato = | d*w. 
Qij Qij 
Since dH*w = H* dw = 0, we see that fs, — fs, œ = 0, which was to be 
proved. | 


Just as in the case of domains of R", this theorem has the following 
corollary: 


(6.7) Corollary Let w be a C” one-form on a simply connected manifold M 
and suppose that dw = 0 everywhere. Then there is a C® function fon M such 
that w = df. If f and g are two such functions, then f — g is constant. 


Proof We choose a fixed basepoint be M and define f at any pe M by 
choosing a piecewise differentiable curve S from b to p and setting 
f(p) = Js œ. Theorem 6.6 assures us that this defines a function on M. The 
remainder of the proof deals with purely local properties; we must show that 
fis a C” function with the property that df = œw. If we show the latter fact, it 
will follow that fis C” because we have assumed œ to be C”. Changing the 
basepoint changes f by an additive constant—the integral of along the 
path between the old and new basepoints—hence does not change df at all; 
therefore it is enough to show that df = w at the basepoint. Let U, ọ be a 
coordinate neighborhood of the basepoint b. We suppose that xt, ..., X" are 
the local coordinates and that g(b) = (0, ..., 0) and ọ(U) = Bi (0) and we let 
f(x}, ..., x") denote the expression for fin local coordinates. Then denoting 
œ in local coordinates by œw = a(x) dx! +- + a,(x) dx", we have, by 
definition f 

F(x) = f a(x) dxt + + anfa) dx", 
C 


the line integral along any path C from (0,..., 0) to (x*, ..., x"). We must 
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show that Of/6x! = a,, j = 1,..., n, at x = (0,..., 0). However, this is im- 
mediate from the definitions: 


(2). = tim 5 (A(O 0-0-4) — f(0,...,0)) 


ex! 


1 ,f . ; 
= lim- | a;(0,..., x4... 0) dx 
ps0 fF 0 


= a,(0, ..., 0). 


This completes the proof, except for the last statement, which is obvious: 
d(f — g) = w — w = 0 so that f — g = constant on the (connected) mani- 
fold M. 


We remark that in terms of the expression of œ in local coordinates, 
dw = 0 is equivalent to (dx,/dx’) — (da,/6x') = 0, 1 < i, j < n. For one- 
forms on R”, this is usually stated by saying that the curl of the vector field 
a, (6/6x') + +++ + x,(0/3x") associated to w vanishes (see Apostol [1]). 

The concept of fundamental group and the techniques of this section are 
intimately related to the notion of covering manifold and of properly discon- 
tinuous group action on a manifold and will allow us to complete the study 
of these phenomena, begun at the end of Chapter HI. We will do this in the 
last section of this chapter. 


Exercises 


1. Prove that a necessary and sufficient condition that every closed curve 
in a connected space M be continuously deformable to a point is that 
m,(M, b) = 1 for some be M. 

2. Let a, b be points of a connected manifold M and show that z,(M, a) 
and z,(M, b) are isomorphic. 

3. Show that 2,(MxN,(a,b)) is naturally isomorphic to 
m,(M, a) x 1,(N, b). 

4. Show that 2,(S!) = Zand 2,(S?) = {1}. Use this to show that $? and T? 
are not homeomorphic. [To show 2,(S”) is trivial one must show as a 
first step that any loop at N, the north pole, is homotopic to one 
contained in the punctured sphere $? — {x} (where x is a point distinct 
from N).] 

5. Show that if G is a connected Lie group, then z, (G) is Abelian. Use e as 
basepoint. 

6. IfF,G:M — N are continuous maps of C” manifolds, F is homotopic 
to G, and F(p) = q = G(p); then F, (7, (M, p)) and G, (,(M, p)) are 
conjugate subgroups of z,(N, q). [Note: We do not assume that the 
homotopy H of F and G is constant on p.] 


PDF pdfFactory Pro www.fineprint.cn 


274 VI INTEGRATION ON MANIFOLDS 


7. Verify properties (i)-(v) used in the proof of Theorem 6.3. 

8. Prove that if we use only piecewise differentiable paths and piecewise 
differentiable homotopies to define z,(M, b), the group obtained is the 
same as when continuous paths and homotopies are used. 

9. Show that C® homotopy of C® mappings of M to N, manifolds 
(without boundary), is an equivalence relation on the set of such 
mappings. 

10. Show that on R? the following integrals depend on the path chosen: 
(a) the line integral of œ = y dx + dy from p = (0,0) to q = (1, 1), 
(b) the line integral of œ = (—y/r?) dx + (x/r?) dy, r? = x? + y? [in 
the latter case, we exclude (0, 0)]. 
In either case is dw = 0? 


7 Some Applications of Differential Forms. 
The de Rham Groups 


It is our purpose in this section to obtain a few results about manifolds 
which are traditionally in the domain of algebraic topology. We do not 
assume a knowledge of this subject, but we have mentioned briefly in the 
Exercises to Section V.8 the following definitions, with some consequences 
which follow from the results of that section. 


(7.1) Definition A k-form œ on a manifold M (with possibly nonempty 
boundary) is said to be closed if dw = 0 everywhere and is said to be exact if 
there is a (k — 1)-form y such that dy = œ. 


We recall some facts about the operator d and apply them here. Let 
Z*(M) denote the closed k-forms on M; since Z*(M) is the kernel of the 
homomorphism d: /\(M) > /\**1(M) it is a linear subspace of /\*(M). 
Similarly the exact k-forms B*(M) are the image of d: /\"~'(M) > /\“(M) 
and thus a linear subspace. Moreover d? = 0 implies that B“(M) c Z*(M) 
which allows us to form the quotient H*(M). 


(7.2) Definition The quotient space H*(M) = Z*(M)/B*(M) is called the 
de Rham group of dimension k of M. If n = dim M, we denote by H*(M) the 
direct sum 


H*(M) = H°(M) @ = @ H"(M). 


Note that H*(M) = Z(M)/B(M), where Z(M) and B(M) are the kernel 
and image of d: N(M) > /\(M), respectively (and the direct sums of the 
Z*(M) and B*(M), k = 0, 1, ..., n). Although called de Rham groups, H*(M), 
k =0,...,n = dim M, are actually vector spaces over R and, in fact, it is 
easy to verify that H*(M) is an algebra with the multiplication being that 
naturally induced by the exterior product of differential forms. This follows 
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directly from the basic property of d which asserts that when ge /\"(M), 
we /\*(M), then 
Up rw) =dornw + (-lleady 


(from which it follows that Z(M) is an algebra containing B(M) as an ideal). 
The great importance of the de Rham groups and the de Rham algebra 
H*(M) stems from de Rham’s theorem, which we state in somewhat 
abbreviated and imprecise form, for its general interest. 


(7.3) Theorem There is a natural isomorphism of H*(M) and the cohomo- 
logy ring of M under which H*(M) corresponds to the kth cohomology group. 


Since we do not assume any knowledge of algebraic topology the coho- 
mology groups will not be defined here, nor will any proof of this important 
theorem be attempted. The reader is referred to Warner [1] and the refer- 
ences found there. We do remark, however, that among the consequences are 
the facts that whenever M is compact the dimension of H*(M) is finite and 
that in any case H*(M) together with its structure as an algebra are topolo- 
gically invariant, that is, if M, and M, are homeomorphic, then H*(M,) and 
H*(M,) are isomorphic as algebras. Finally we mention that the duality 
which appears in algebraic topology between homology and cohomology 
groups of a space extends to a duality of homology groups and de Rham 
groups via integration and Stokes’s theorem—a further motivation for the 
earlier sections of this chapter. Of particular interest to us is the fact that we 
can use these de Rham groups, without using algebraic topology, to obtain 
interesting results about manifolds. Samples are given below and in the 
following section. 

It is a basic property of differential forms that a C® mapping 
F: M, =M, defines a corresponding homomorphism F*: /\(M,) > 
/\(M,). Since F*d=dF*, it follows that F*(Z*(M2)) = Z(M,) and 
F*(B‘(M,)) = B‘(M,). Therefore F* induces a homomorphism, which we 
also denote by F*, of H*(M,) into H*(M,). Since F* is an algebra homomor- 
phism on forms, F*: H*(M,) > H*(M,) is also an algebra homomorphism. 
In summary, with the above notation we have the following lemma: 


(7.4) Lemma 4 C” mapping F: M, > M, induces an algebra homomor- 
phism F*: H*(M;) > H*(M,) which carries H*(M>) (linearly) into H*(M,) 
for all k. If F is the identity mapping on M, then F*: H*(M) + H*(M) is the 
identity isomorphism. Under composition of mappings we have 
(Go F)* = F* o G*. 


Using this lemma, we can obtain a weak version of the invariance of 
H*(M) under homeomorphism mentioned above. 
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(7.5) Corollary If M, and M, are diffeomorphic manifolds, then H *(M,) 
and (H*(M_,) are isomorphic as algebras. 


Proof Let F: M, > M, bea diffeomorphism and F`! its inverse. Then 
F-'* o F* = (F o F~!)* and F* o F7'* = (F`! o F)* are both the identity 
isomorphism, hence F* is an isomorphism with inverse F~ e i 


Although the groups H*(M) are difficult to compute using only the tools 
which we have available, which do not include algebraic topology, we can 
obtain information in special cases—information which we can then use in 
some applications. 


(7.6) Theorem Let M be a C® manifold with a finite number r of compo- 
nents. Then H°(M) = V", a vector space over R of dimension r. 


Proof /\°(M) consists of C*-functions on M and Z°(M) of those func- 
tions f for which df = 0. There are no forms of dimension less than zero so 
B°(M) = {0} and H°(M) = Z°(M). We have seen previously that df = 0 if 


and only if f is constant on each component M,,...,M,. Thus H°(M) = 
{(a;, ...,a,): a;€ R}, where (a,, ..., a,) corresponds to the function taking the 
constant value a; on M;, i= 1,...,r. | 


(7.7) Remark For M = {p}, a point, /)\“(M) = {0} if k # 0 and /A\°(M) = 
R. We define dw = 0 for all forms œ. It follows that H°({p}) = R, {p} being a 
zero-dimensional manifold; this determines the de Rham groups of a point 
space—since /\*({p}) = 0, H*({p}) = 0 for k > 0. 


As an immediate consequence of Corollary 6.7 and Theorem 7.6, we have 
the following theorem: 


(7.8) Theorem If M is a simply connected manifold, then H°(M) = R and 
H'(M) = {0}. 


Proof Suppose w is a closed one-form on M, that is, dw = 0. Then there 
exists a function f on M such that df = w, thus q is exact. Since every closed 
one-form is exact, H'(M) = {0}. H°(M) = R since M is connected. I 


In addition to this information concerning H°(M) and H'(M) we may 
also prove the following statement concerning the highest-dimensional 
de Rham group H"(M), n = dim M. 


(7.9) Theorem Let M be a compact orientable manifold of dimension n with 


ôM = Ø. Then there is a natural homomorphism of H"(M) onto R. In 
particular, H"(M) # {0} 
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Proof Since dim M = n, every n-form on M is closed; hence Z"(M) = 
/\(M). Define a homomorphism h: Z(M)->R by h(Q)=fyQ. If 
Qe B(M), then Q = dw for some we /\""'(M). Since 3M = Ø, it follows 
from Stokes’s theorem that 


KO = ndo) = | do = f wo =0. 
M ôM 


Therefore h determines a homomorphism h’: H"(M) > R. Let Q, be a volume 
element on M. Then A(Q%) = fm Qo # 0 by Theorem 2.2. Thus, by suitable 
choice of a constant c, we can make h(cQ)) = c fu Qo take any real number as 
value. Hence h and k’ are onto. | 


The Homotopy Operator 


In order to obtain some further results concerning de Rham groups we 
will introduce a special operator .¥, the homotopy operator. Let A c R" be 
such that A is the closure of the interior of A. We have in mind regular 
domains, cubes, simplices, or their interiors. Note that J x A is the closure of 
an open set, its own interior, in R x R” = R"*!. When A is not open, a C” 
k-form w on A is the restriction to A of a k-form @ on an open set U > A—by 
definition of differentiability of functions (in this instance its components) on 
A. Our restrictions on A ensure that all derivatives of any C” function f on A 
are defined at every pe A independently of the open set U and extension f 
which may be needed to define them at boundary points. This is a conse- 
quence of the continuity of all derivatives of J on U and of the fact that every 
péA is either an interior point—where the derivatives are already defined 
without any f—or the limit of interior points. It follows that for a C” form œ 
on A, dw is defined, even at boundary points. 


(7.10) Definition The homotopy operator # is defined to be an R-linear 
operator from /\**'(I x A) => /\*(A). On monomials -¥ is defined as fol- 
lows: If w= a(t, x)dx''a---vdx'**!,| we set %m=0; and if 
w = a(t, x) dt a dx" a+: a dx'*, we define fw by 


1 
IO = (i a(t, x) i) axa ada 
0 


Having been thus defined for monomials, we extend .¥ to be R-linear on 
At H(I x A) with values in /\*(A). 


We will denote by i: A >I x A the natural injection i,(x) = (t, x) and 


then œw, will denote i*w; in particular œ = iœ and œ, = i*@. With these 
definitions and notations we find that .¥ has the following basic properties. 
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(7.11) Lemma The homotopy operator : /\** "(I x A)> /\\(A) in addi- 
tion to being R-linear has the following properties: 


(i) it commutes with C® functions which are independent of t; 
(ii) for all we /\K**(I x A) it satisfies the relation 


S dw + dfw = w, — wo. 


Proof If fis independent of t, we may consider it both as a function on 
I x Aand on A. Since it is independent of t it can be moved through the 
integral sign in the definition of £, thus £ fw = f. 4a. 

For the second property we must verify the equation directly; it is 
enough to do so for monomials since d, %, if, and if are all R-linear. First 
we consider the case where w does not involve dt, in other words 
w = a(t, x) dxa a dx**!, Then £m = 0 so that dfw = 0 and F dw is 
given by 


1 
I dw = (i d) dxit aa dx®* = (a(1, x) — a(0, x)) dx as a dx. 
0 


But the right side is then exactly if — i§@ = w, — wo, which thereby 
establishes the equality for this case. 

Now suppose that œ = a(t, x) dt a dxi! a a dx. Computing ¥ dw, we 
see that 


n 1 fad , , , 
S dw = — — dt) dxi n dx" A+++ a dx". 
i p> ( o Ox! ) ^ i 

On the other hand using the Leibniz rule to differentiate under the integral 
sign (see Exercise 1.6), we may compute d.f w: 

1 

dfw = a(| a(t, x) an) dxit a+ a dxit 
0 
= 16 : : ; 

=}, (i, sit dxi adx"! att A dx". 
Adding these expressions we see that £ dw + dfw = 0. On the other hand 
since it dt = 0 = iš dt, we have 0 = ifw — ifm = w, — Wo. Thus in all 
cases (ii) holds. | 


We continue to denote by A a subset of R" which is either open or is the 
closure of an open set, and we use Lemma 7.11 to obtain Poincaré’s lemma. 


(7.12) Lemma If A is star-shaped, then H*(A) = {0} for all k > 1. Hence 
H*(A) is isomorphic to the cohomology ring of a point. In particular, this holds 
for A= R’". 
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Proof We recall that A is star-shaped if it contains a point 0 such that 
for any pe A, the segment Op lies entirely in 4. By suitable choice of coordin- 
ates we may suppose that 0 is the origin. We define H: I x A > A as 


H(t, xt, ..., x") = Oca tX"). 


If w is a k-form on A, then H*% is a k-form on I x A. In the definition of .7 
we defined ig: x > (0, x) and i,: x —> (1, x); therefore H < ig: A — {0} and 
H °i,: A > A is the identity. For k > 0, i§(H*@) = 0 since /\*({0}) = 0 for a 
point space {0}. Thus, 


dI(H*w) + I d(H*w) = i¥(H*w), 
so that if dw = 0, then dH*%œ = 0 and 
d.f H*w = (H° i )*® =o. 


Therefore every closed k-form w on 4 is exact if k > 1. If k = 0, then we may 
use the fact that A is connected to see that H9(A)= R. | 


In fact, the homotopy operator .¥ can be defined and used under more 
general hypotheses. We have supposed that 4 is a particular type of subset of 
R", but it is possible to extend the definition to manifolds with or without 
boundary. As a sample we shall prove the following theorem. 


(7.13) Theorem Let M and N be C® manifolds and assume 0M = Ø. 
Suppose that F and G are C® mappings of M into N which are C” homotopic. 
Then the corresponding homomorphisms F* and G* of H*(N) into H*(M) are 
equal. 


Proof We shall use our previously defined operator .¥ to construct a 
similar operator .#: /\** (I x M) > /\*(M). First we note that M may be 
covered by a collection of coordinate neighborhoods, U;, g; with g,(U;) = 
Bi(0), n = dim M and i= 1, 2,..., for which we have a subordinate C” 
partition of unity { f;}, supp f; = U;. Then any (k + 1)-form œ on I x M can 
be written as a sum of forms with support in I x U;, 


fio on U;, 
o=} a; w; = 
0 on M — supp f,. 


This sum is, of course, finite on a neighborhood of each pe M. We may 
consider f;, or any functions on M, as being also functions on I x M which 
are independent of t. We define £ to be additive so that fw = $. ¥«;, which 
leaves only the problem of defining .4 on forms with support in one of the 
neighborhoods I x U;. 

When w has support in a neighborhood J x U, where U, 9 is a coordin- 
ate neighborhood with @(U)= Bi(0), we proceed as follows. Let 
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go: 1 x U —> I x Bi (0) be defined by g(t, p) = (t, y(p)). Then define fw on 
I x U, using our previous definition of # for I x Bi(0), by %a|u= 
*(4(@ *w)), the J on the right side being the operator defined earlier, 
and further, let œw = 0 on M — U. This defines a C* k-form on M, the 
image ofa (k + 1)-form on 7 x M. By Lemma 7.11 for this form œw we have the 
relation sdo + dfw = w, — wo. Now since Jd + d¥ is an additive oper- 
ator, then for an arbitrary we /\**"(I x M) we may apply the decomposi- 
tion w = )) œ; to obtain 


S do + dfo =d] w; + dF Yow; 
=F F dw, + ¥ dfo; = } ((@)1 — (io) 
= WM, — Wo. 
Finally, to complete the proof we let w be any closed k-form on N and we 
must show that G*w — F*w is exact. Now let H:I x M > N be the homo- 


topy connecting F and G. Then F(p) = H(0, p) = H » iy and G(p) = H(1, p) = 
H oi,, where i,(p) = (t, p) as before. Since dH*w = H* dw = 0, we have then, 


df H*%w = i*H*w — iğH*w = Gow — F*o, 
i 0 
as was to be shown. I 


We say that a manifold M is C” contractible if there is a C” homotopy 
H:I x M—M of the identity and a constant map, i.e., for all ge M, 
H(0, 4) = q and H(1, q) = p, p fixed. 


(7.14) Corollary If M is C® contractible, then it has the cohomology of a 
point space: H*(M) = {0} for k > 0 and H°(M) = R. In particular, a compact, 
orientable manifold is not C® contractible. 

Proof This follows from the theorem with M = N and Remark 7.7. For 


the last statement, if M were C” contractible it would contradict Theorem 
7.9. | 


(7.15) Remark It is not difficult using theorems like Theorem V.4.8 to 
extend many of the results here and below to the case of continuous maps and 
homotopies. 


Exercises 


1. Let œ be a closed one-form on a compact manifold M. We define a 
mapping F.,: 7,(M, b) > R by the following method. Let f: [0, 1] + M 
be a piecewise C! loop S at b and denote by [f] the corresponding 
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element of 2,(M, b). We define F,,([f]) = fs œ. Show that F,, isa homo- 
morphism whose kernel contains the commutator subgroup of the fun- 
damental group. 

2. Compute H*(S') and H*(M). where M is a Möbius band. 

3. Let M be a manifold, N a submanifold of M, and R: M >N a C* 
mapping which leaves N pointwise fixed. Show that R*: H*(N) > 
H*(M) is an injective homomorphism. 

4. Let M=M, x M, and let P;: M > M; be the natural projections. 
Show that P*: H*(M,) > H*(M) is an injective homomorphism. 

5. Let A.(M)< AM) denote the subalgebra of differential forms of 
compact support. Show that H*(M) may be defined analogously to 
H*(M) and is naturally isomorphic to a subalgebra. Prove that if M is 
connected and not compact, then H°(M) = {0}. 


8 Some Further Applications of de Rham Groups 


In this section we use the information on de Rham groups accumulated 
in the previous section to obtain some very interesting facts which one 
customarily demonstrates by use of algebraic topology. In particular, it is 
proved here that there is no vector field on S$? which does not vanish at some 
point—a fact to which we have frequently alluded. In a very beautiful little 
book, Milnor [2] has shown how many purely topological results can be 
obtained by differentiable methods. Using this idea and the results of 
Section 7, we give a demonstration of the Brouwer fixed point theorem, 
following the author’s note (Boothby [1]). 

We begin our proof of this last mentioned theorem by establishing a 
lemma. Let D” denote B7(0), the closed unit ball in R”. Then D” is a manifold 
with boundary, ôD" = S"~*. (For another proof, see Milnor [6].) 


(8.1) Lemma There is no C* map F: D" > ĉD" which leaves CD" pointwise 
fixed. 


Proof Suppose that there existed such a map F and let G denote the 
identity map of êD" > D". Then F < G = I, the identity map of ¢D" > ¢D". 
This implies that G* + F* = (F o G)* induces the identity isomorphism on 
H*(2D"). Therefore the homomorphism F*: H"~ '(¢D") > H"~ '(D") must be 
injective, that is, ker F* = {0}. Since H"~'(D") = {0} by Poincaré’s lemma, it 
follows ker F* = H*` '(@D") and therefore that H"~'(CD") = {0}. However, 
because ĉD" = S"~', an orientable and compact manifold without bound- 
ary, Theorem 7.9 shows that H"~?(@D") = H"~'(S"~') 4 {0}. This contra- 
diction implies that no such map F exists. i 


Using this lemma we may establish (as in Milnor [2]) a very well-known 
theorem of algebraic topology, the Brouwer fixed point theorem. 
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(8.2) Theorem (Brouwer) Let X bea topological space homeomorphic to 
D”. Then any continuous map F: X > X has a fixed point, that is, for each F 
there is at least one xE X such that F(xo) = Xo. 


Proof Asa first step we note that it is enough to prove the theorem for 
D". Let H: D" » X bea homeomorphism, and let F: X > X be any contin- 
uous mapping. If H`'» Fo H:D"— bD” has a fixed point yo, then 
xo = H(yo) is fixed by F. 

Moreover, even in the case of D”, it is enough to establish the property 
for C” maps F: D” > D". To see this we suppose every such C” map has a 
fixed point and assume there is some continuous map G: D” > D” which has 
no fixed point. Then |G(x) — x|| is bounded away from zero on the 
(compact) set D", and we may find an £ > 0 such that ||G(x) — x|| > 3e. 
Using the Weierstrass approximation theorem (Lemma V.4.9), or the 
approximation theorem given in Exercise 2, we approximate G to within € 
by a C® mapping G,; then |/G(x) — G,(x)|| < e for all xe D". However, 
since the values G,(x) are not necessarily in D” for every x € D”, we replace 
G, by F(x) = (1 + e) +G,(x). Clearly F(x) is defined and C% on D” and 
F(D") < D". Since ||G(x)|| < 1, it follows that [G(x < 1+ ¢ and 
|| F(x) < 1 for all xe D”. Thus F maps D” into D" and is C®. For xe D”, 


| G(x) — F(x) = |G) — (1 + 2) Gis) 
(1 +e)” *|eG(x) + G(x) — Gilx)| 
el|G(x)|| + G(x) — Gi (x)| = 2. 


From these inequalities we obtain a contradiction to the assumption that 
every C” map F: D" > D" leaves some point fixed. Namely, for every x € D” 
we have 


lA 


F(x) — xl] = (GO) — x) = (Gx) — FR) 
= ||G(x) — x| — G(x) — Fœ) 
> 3e — 2e = e. 
This contradiction shows that if every C” map of D" to D" has a fixed point, 


then so must every continuous one. The proof of the theorem is then 
completed by the following lemma. I 


(8.3) Lemma If F: D" > D" is a C* map, then F has a fixed point. 


Proof This is again a proof by contradiction. We suppose there exists 
an F: D” > D" which is C” and has no fixed point. We shall use F to 
construct a C” map from F: D" > 6D" which leaves 6D" pointwise fixed. 
Namely, given x € D”, let F(x) be the boundary point obtained by extending 
the segment F(x)x past x to the boundary of D” (Fig. VI.13). In particular, if 
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Fix)=x+hu 


Figure VI.13 


x €cD", then F(x) = x and, in any case, F(D") c €D". To see that F is C® we 
express F explicitly using vector notation in R”. Namely, F(x) = x + Au, 
where x denotes the vector from (0, ..., 0) to x = (xt, ..., x"), u is the unit 
vector directed from F(x) to x and lying on this segment, more precisely, 


_ *— Fe) 
(fx — F(x) 


and where 2 = — (x, u) + [1 — (x, x) + (x, u)?]'”? denotes the length of the 
vector on u with initial point x and terminal point F(x) on ĉD”. Since F is 
C”, it is easy to check that F is C*. Figure VI.13 represents the intersection 
of D” with the 2-plane determined by three points: the origin 0, x, and F(x). 
The scalar Ż is the unique nonnegative number such that |x + dul] = 1. 
Since F is C*, u is C*, so wherever the expression under the radical is 
positive, then F(x) is also C*. However, 1 — (x, x) > 0 with equality only if 
xéS"~*; and (x, u} > 0 with equality only when u is orthogonal to x, that 
is, when x — F(x) is orthogonal to x. However, (x, u) = 0 cannot occur 
when (x, x) = 1, that is, on a point of S"~’, since in this case F(x) would be 
exterior to D”. Thus 1 — (x, x) + (x, u}? > 0 on D" and F is C*. The exist- 
ence of F contradicts Lemma 8.1, so F has a fixed point, which completes the 
proof of Brouwer’s fixed point theorem. i 


u 


As another application of these ideas we prove the following theorem 
concerning the antipodal map A(x) = —x on the unit sphere $”~' of R”. 


(8.4) Theorem [fn is odd, then there is no C” homotopy between the anti- 
podal map A: S"~' + S"~! and the identity map of S"~'. 


Proof The sphere is an orientable manifold, in fact we may define the 
oriented orthonormal frames of T,(S"~') at each xe S"~' in the following 
fashion. Each x e S""! determines a unit vector x = Ox, and the elements 
of T,(S"~') correspond to the vectors in the orthogonal complement of x. If 
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ej, ..-,@,—1 is an orthonormal frame of T,(S"~ 1) in the induced metric of R”, 
then x, e,,...,@,-; is an orthonormal frame of R’—we use the natural 
parallelism to identify vectors at distinct points of R". Two frames, 
€1,-+-5@,—; and ej, ..., €41, at x will be said to have the same orientation if 
the corresponding frames x, e;,...,€,-; and x, e,...,@,-, do. Then from 
the canonical orientation of R” we obtain an orientation of S"~ ' by choosing 
as oriented that class of frames for which x, e;,..., €, 1 is an oriented frame 
of R”. This is exactly the orientation 6D” introduced in connection with 
Stokes’s theorem. Let Q be the unique (n — 1)-form on S”~’ which takes the 
value +1 on all oriented orthonormal frames e,,...,€,—. Since A: S""* > 
S"~1 is the restriction to S"~! of a linear, in fact an orthogonal, map of R" its 
Jacobian is constant and just the map 4 itself. Thus under A the frame 


€,,...,€,-1 at x goes to the frame —e,,..., —e,-, at —x. It is clear that this 
will be oriented according to our orientation of S"~' if and only if n is even so 
that x,e,,.-.,e,-, and —x, —e;,..., —e,-, are coherently oriented frames 


of R”. Therefore A*Q = (—1)"Q and when n is odd, Q = — A*Q. 

If there were a C” homotopy connecting A and the identity, then 
Q — A*Q must be exact by Theorem 7.13. Since the integral over S"~* of an 
exact form is zero by Stokes’s theorem, this means that when n is odd, 


2{ Q=[ (@-4*Q)=0. 
“§n-1 varmi 
But this is impossible since the integral over S”~* of the volume element is 
positive. [For an alternative proof see Exercise 3.] | 


We shall deduce two consequences. First, recall that although we have 
discussed orientability of manifolds at some length and have shown that 
some manifolds are orientable, for example, S"7! = 6D", we have never 
presented an example of a manifold which cannot be oriented. We shall 
remedy this omission now. 


(8.5) Corollary Real projective space P"(R) is not orientable when n is even. 


Proof Suppose that it is; we know that $” is a (two-sheeted) covering 
manifold of P"(R) which can thus be obtained from S” as the orbit space of 
the group of two elements Z, acting on S”. This action is obtained by letting 
the generator of Z, correspond to the antipodal map 4 (Example III.8.2). If 
Q is a nowhere vanishing n-form on P"(R) and F: S" > P"(R) is the covering 
map, then F*Q = Q* is a nowhere vanishing n-form on S". Moreover since 
Fo» A= F we see that A*Q* = Q*, which is not possible since A reverses 
orientation if n + t is odd. Thus P"(R) is not orientable when n is even. ff 


(8.6) Theorem [fn is even, then there does not exist a C” -vector field X on 
S" which is not zero at some point. 
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Proof We suppose that such a vector field exists and show that this 
implies that the antipodal map A and the identity map J on S$" are C” 
homotopic. Let X be a C*-vector field on S” such that X is never zero. Then 
X/|X|| is a C®-vector field of unit vectors (we use the induced metric of 
R"* 1) so we may suppose to begin with that |X|! = 1 on S" If x is a point of 
S”, let X, be the corresponding vector of the field. Treating R"*?! as a vector 
space and thinking of x as a radius vector, we have (x, X,) = 0 for every x, 
and we define the homotopy H: S" x I > S" by 


H(x, t) = (cos nt)x + (sin nt)X,. 


Then A(x, t) is C” and, since ||H(x, t)|| = 1, it defines a map of S” — S” for 
each t. Thus H(x, 0) = x and H(x, 1) = — x as claimed. However, the exist- 
ence of such a homotopy when n is even contradicts the previous proposi- 
tion; therefore in this case no such vector field exists. | 


(8.7) Remark We have noted previously that when n is odd, then the 


vector field X, assigning to x = (x', x*,..., x", x"*!)e S” the unit vector 
ô ô ô ô 
oe S E T ats RHL | 
Hae ax  * Qa tot ox" X gami 


orthogonal to x defines a nowhere vanishing field of tangent vectors to $”. It 
follows that in this case A is homotopic to the identity. 


The de Rham Groups of Lie Groups 


We shall briefly touch on a special case of considerable interest in the 
theory of de Rham groups; in fact it is the case which may have led to their 
discovery. We suppose that G is a compact connected Lie group, for example 
SO(n), and that 0: G x M > M is an action of G ona compact manifold M. 
As usual 0, denotes the diffeomorphism of M defined by 0,(p) = 8(g, p). A 
covariant tensor @ on M, in particular an exterior differential form, is said to 
be invariant if Ož = ¢ for each g e G. Since d(@*p) = 0*(dq) for every form 

o (Theorem V.5.2), we see that if g is invariant, dọ is also. Let AM ) denote 


e subspace of 1 ) which consists of all invariant k- ae Then 
(A v H (= AH a we re just seen; and we may define Z‘(M) = 
{pe a = a and B(M) = d(/\*(M)) = Z*(M), the closed invar- 


iant Ae eg “invariantly bia forms of patel, k. We then make the 
following definition. 


(8.8) Definition The invariant de Rham groups of M, denoted by H*(M), 
are defined by H*(M) = Z*(M)/B*(M). 


We note that the natural inclusion i of A*(M) in /\*(M) takes Z*(M) into 
Z*(M) and B(M) into B‘(M) and hence induces : homomorphism 
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ią: H*(M) > H*(M). In order to study this homomorphism we define an 
R-linear operator P: /\*(M) > /A\(M). If y € /\*(M), then let Q denote the 
bi-invariant volume element for which vol(G) = 1 and define Pe by 


PolX i- X) = | OF (KX 4, «204 Xa) 
G 
This operator has the following properties: 
(8.9) Lemma _# takes a k-form to an invariant k-form, that is, 
P(/\*(M)) = N(M). Moreover, 


(i) if ge A*(M), then Po = 9; 
(ii) dP = Pad. 


Proof It is easy to check that Poe /\*(M) and in fact is G-invariant: 


0*PO(X 1... Xx) = POO an X 15 +++ Bay Xt) = | OF Gay X 14 +--+ Dax XQ 
“G 


= | ox[0%9(X1,..-, X2 = | OF 9(X no XQ 
“G *G 


= | O%9(X,,..., X) 
*G 


The fact that Pp is C® and (ii) are consequences of the Leibniz rule for 
differentiating under the integral sign. If ọ is G-invariant, then 07g = ¢ for 
all ge G, or more precisely at each pe M, 


Ož Pog, p(X ip» > Xkp) = PAX tp» ee X kp) 
From this it follows that 
POX i- Xi) = | OFG(X i -> XQ = GX iy he) | O: 
*G “G 


Since fe Q = 1, Py = ọ and property (i) are established. | 


The lemma leads to the following result for G, M and H*(M) as described 
above: 


(8.10) Theorem The homomorphism i,: H*(M) > H*(M) is an isomor- 
phism into for each k = 0,1,..., dim M. 


Proof Suppose that [@] is an element of H*(M) and that @ is a closed 
invariant form on M belonging to the class [@]. In order to show that i, is 
one-to-one we need only see that if @ = do, o e AYT (M), then @ is the 
image under d of an element of A\*~ ‘(M), that is, that if @ is exact, then it is 
“invariantly exact.” This follows from Lemma 8.9 since 
@ = Po = P do = d(Po) and Poe /\*"*(M). | 
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(8.11) Remark It is also true, but somewhat harder to prove directly, that 
i, is onto, that is, H*(M) is isomorphic to H*(M) for all k. For details the 
reader is referred to Chevalley and Eilenberg [1] or Greub et al. [1]. 


This theorem, coupled with the remark, enables one to reduce to alge- 
braic form many questions about the cohomology of homogeneous spaces. 
In particular, computation of the de Rham groups of such a space can be 
converted to a problem concerning representations of the isotropy subgroup 
of a point. Consideration of these interesting questions would take us too far 
afield, however we touch very briefly on one special case. 


(812) Lemma Let ®, be a covariant tensor of order r on T(G), where G is a 
connected Lie group. If Ad g*®, = ®,, that is, if ©, determines a bi-invariant 
tensor on G, then for any X,,...,X,, Zeg, we have 


r 


AE D Xa X= 


i=1 


Proof Let ® denote the bi-invariant covariant tensor on G determined 
by ®,. Given Zeg, a left-invariant vector field on G, we have seen 
(Sections IV.5 and IV.6) that Z is complete and that the one-parameter 
group action 6: R x G —> G which it determines is given by right transla- 
tions by the elements of a uniquely determined one-parameter subgroup 
g(t) = exp tZ by the formula 0, = R,,,. We have previously (Theorem IV.7.8) 
established the following formula for C*-vector fields on a manifold (in this 
case on G): 

. 1l 
[Z, X], = a t [0 _ ia X op) = X) 
is 
If we suppose that p = e and that X is a left-invariant vector field, then 
[Z, X] is just the product in the Lie algebra g. If we identify g with T.(G), we 
may write 
ea ll 
[Z, X] = lim z Bacos Xan — Xe]. 


t>0 


Since ©® is bi-invariant, Ri» ® — ® = 0; thus for any X,,..., X,€q, 


OR i Neh ge — (Xa... X) =. 


g{— the ~* Ls 


Adding and subtracting ®(X,,...,X;_,, R X Rg(—-peXr» i= 1, 


Gti, we, 
...,¥, then multiplying by 1/t and letting t + 0, we obtain the formula. 


(8.13) Corollary Every bi-invariant exterior form on a Lie group G is 
closed. 
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Proof Let w be an exterior differential r-form. If œ is left-invariant and 
Xi, X>,...,X,+1 are left-invariant, then, using a caret to denote missing 


terms, 
tti 


dX. Xna) D, (DIX OX os Any Xr) 
i=1 


+ F Ell Vo XX Xa Sieg A pete) 

i<j 
We previously established this formula for r= 1 (Lemma V.8.4). The 
method of proof in the general case is the same (compare Exercise V.8.3). 
The first term on the right vanishes since w and X,, j = 1,...,r + 1, are 
left-invariant, the second as a consequence of the lemma. | 


Now we suppose that G acts on itself by both left and right translations. 
More formally, let G = M and let K = G x G, the direct product of Lie 
groups, and define 6: K x M > M for each xe M = Gand k = (g1,9.)€K 
by 

O(k, x) T= 91X92" (=R,;1 s L, 0). 


Then the K-invariant forms @ on G are exactly the bi-invariant forms. 


(8.14) Corollary Each bi-invariant r-form on a compact, connected, Lie 
group G determines a nonzero element of H "(G). 


Proof By Corollary 8.13 each ge A'(G), that is, each bi-invariant 
r-form is closed. We know that if it is exact, then it must be of the form dé 
with @ bi-invariant. But then it is zero, by ihe corollary again, since dé = 0. 


(8.15) Example Consider any compact, connected, non-Abelian Lie group 
G, for example, SO(n), the orthogonal matrix group (with elements of deter- 
minant +1) for n > 3. We claim that H*(G) + {0}. We consider that the 
exterior three-form (X, Y, Z) = ([X, Y], Z) on G; (X, Y) denotes the bi- 
invariant inner product. Since X, Y e g implies that [X, Y] is left-invariant 
and since Ad(g) is an automorphism of g, it follows readily that @ is bi- 
invariant. The alternating property of o follows from [X, Y] = —-[Y; X] 
together with the symmetry of (X, Y). By Corollary 8.14, ọ is closed and if it 
is not zero, it determines an element of H*(G). Suppose that p = 0. Then for 
all X,Y,Zeg, we have ọ(X, Y, Z)= ([X, Y], Z)=0. In particular, 
([X, Y], [X, Y]) = 0 so that [X, Y] = 0 forall X, Y e g. This means, accord- 
ing to Section IV.7, that the one-parameter groups of G commute. It follows 
that there is a neighborhood U of e which consists of commuting elements. 
By the connectedness of G it follows that the elements of U generate G, 
which is therefore commutative, contrary to assumption. This means that @ 
determines a nonvanishing element [¢] of H°(G). 
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Exercises 

1. Let A be a closed subset of the metric space R” and f: A > R a contin- 
uous bounded function. Show that there is a continuous extension of f to 
all of R” (Tietze-Urysohn extension theorem). 

2. Given ô > 0, let g;(x) be a nonnegative C” function on R” such that 
supp gs = B3(0) and fra g3(x) dx! --- dx" = 1. Let fbe a continuous func- 
tion defined on an open subset U and let K be compact with K c U. 
Given £ > 0, choose ô > 0 so that: (i) ô is less than the distance from K 
to R"—U and (ii) for |y|| <ô and xe K, | f(y +x) —f(x)| <e. 
Show that ; 

g(x) = I SO + x)gsly) dy" + dy" 
is a C” function and that | g(x) — f(x)| < eon K. 

3. Prove Theorem 8.4 by using on S”~! the form Q obtained by restricting 
Yao. (117 xt dxt ava dal! a dx!t! a eea dx" to S71, 

4. Show that if n is odd, then P"(R) is orientable. 

5. Show that there exists no continuous vector field on S", n even, which is 
nowhere zero. 

6. Using Remark 8.11, prove that if G is a connected compact Lie group of 
dimension n, then H"(G) = R. 

7. Using Remark 8.11 compute H*(7"), k = 0, 1,..., n. Give a formula for 
dim H*(T") (as a vector space over R). 

8. Define the center ¢ of the Lie algebra g of the compact connected Lie 
group G by ¢ = {Z e g | [Z, X] = 0 YX e g}. Prove that it is a subalgebra 
and that if Z € ¢, then exp tZ is in the center of G. Show that if ¢ = {0}, 
then Ë! (G) = {0}. 


9 Covering Spaces and the Fundamental Group 


The ideas involved in paths, loops, their homotopies, and the fundamen- 
tal group of a manifold M, which were discussed in Section 6 have an inti- 
mate connection with the covering spaces of M and with properly 
discontinuous groups, which were considered much earlier (Sections II.8 
and III.9). Clarifying this relationship will enable us to complete the discus- 
sion of Chapter HI in several important respects. 

Suppose in what follows that M is a manifold, M a covering manifold, 
and F: M > M the (C®) covering mapping. If X is a topological space and 
G: X + M a continuous mapping, then a continuous mapping G: X > M is 
said to cover G if F o G = G; we also say G is a lift of G. For example, if 
fi:I—>M is a path or loop, then f: I + M is a path which covers it if 
F o f(t) = f (t) for0 < t < 1. Ifa covering fofa given path f exists at all, then 
it is uniquely determined by its value on a single point, say by f (0). More 
generally, with the notation above we have the following results. 
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(9.1) Lemma If F:M >M isa covering and X is a connected space, then 
two (continuous) mappings G,, G,: X > M covering a continuous mapping 
G: X —> M agree if they have the same value at a single point xo € X. 


Proof Let A = {xe X: G,(x) = G,(x)}. Then A is closed by continuity 
of G, and G,. A is also open, for if xe A and if U is a neighborhood of 
G,(x )= G(x x) such that F | U is a diffeomorphism of U to M, then it follows 
that G, and G, must agree on the open set V = G;!(U) ^ Gz 1(U). In fact if 
ye V, then F o G,(y) = F - G,(y) by hypothesis; but since G,(y) and G,(y) 
are in U, on which F is one-to-one, they must be equal. Finally since A is not 
empty and X is connected, A = X. | 


(9.2) Theorem Let f: 1 > M be a path in M with initial point b = f (0). If 
F: M > M is a covering and b € F~ ‘(b), then there is a unique path f in M with 
initial point f(0) = Ë which covers f(t). 


Proof Uniqueness is a consequence of the previous proposition. To 
prove existence we suppose 0 = tọ < t <` < t, = 1 is any partition of I 
such that for each i, f ([t;, ti+ı]) lies in an admissible neighborhood V, with 
respect to the covering. The existence of such a partition follows from the 
compactness of J and the continuity of f. We let f(0) = b and let be M 
denote a point over b, that is, F(b) = b. If U, is the unique connected 
component of F~'(V,) containing b, then we define f(t), 0 < t < t, by 
F(t) = (F|U,) ye )). Then f(t,)¢ U, ^ Uz, where U, is the unique com- 
ponent of F~'(V,) containing f(t,). This allows us to define f(t) = 

(F | U2) *(f(t)) for ti < t < t, and thus determine fon [to, t,]. Clearly we 
can continue in this fashion to define f on all of I. (See Fig. VI.14.) | 


It is important to realize that not only a path but even a homotopy of 
paths can be lifted to a covering M of M by this procedure. 


Suppose that f,g:1 > M are paths and H:I x I > Misa homotopy of f 
and g, i.e., H(0,s) = f(s), H(1, s) = g(s), and F: M — M is a covering of M as 
above. 


(9.3) Theorem Given be M such that F(b) = b = f(0) (=H(0,0), then 
there is a uniquely determined homotopy H(t, s) with H(0,0) = 6 which covers 
H(t, s), i.e., such that F 0 Ñ = H. If H leaves fixed the endpoints of f and g, then 
Ñ leaves fixed the endpoints of the paths Ñ, covering H (s) at each stage of the 
homotopy. 


Proof First suppose that H(I x I) c V, an admissible neighborhood on 
M relative to the covering. Let U c M be the unique neighborhood in 
F~*(V) which contains 6. Then F|U:U >V is a diffeomorphism and 
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Figure VI.14 
Lifting a path from T? to its covering R?. 


= (F|U)~! > H is the unique lift of H such that Ai(0, 0) = B. In the general 
case we choose £ = 1/k sufficiently small so that by partitioning I x J into k? 
squares of side g, i.e., partitioning by lines t = i/k, s = j/k, i,j =0,1..... k, we 
ensure that each square Q;; = {(t, s)|i/k < t < (i + 1)/k, j/k < s < (j + 1)/k}, 
0 <i,j <k — 1, lies in an admissible neighborhood V;;. This is possible since 
H(I x I) is compact. We then proceed as follows: Define foo: Qoo > M so 
that Ho9(0, 0) = B and F o Foo = H| Qoo, using the method above. This lift is 
unique and defines Ho,(0, 1/k) so we may define Ñ.: Qo; > M so that 
Fly, (0, 1/k) = Ho9(0, 1/k) and F o o Hy, = H|Qo,. By uniqueness, Foo = Ño 
on their common boundary Qoo © Qoi- Continuing in this way we define 
Foo, Ĥo, n nfor- 1° 

This defines a unique continuous lift Ñ, of H restricted to the strip 
{(t,s)|0<t < 1/k, O< s <1}, with Ñ (0,0) = 6. We now continue this 
process on the next row of squares: Q10,- - -, Qix- p etc. (see Fig. VI.15). After 
a finite number of steps, a ar mapping Ñ: I x I > M is defined such 
that (0,0) = 6 and FoH =H. 

If the endpoints alte fixed at all stages of the homotopy H, then we 
must have H(t, 0) = b and H(t, 1) = c, common initial and terminal | points of 
f,g. Since F o A(t,0) = b, it follows that the continuous path t > A(t, 0) lies 
in the discrete set F~‘(b). Hence it must be a single point—in this case b. 
Similarly, A(t, 1) is a single point lying over c. 
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Jik Of tlk s 


Figure VI.15 
The next step extends the lift A of H from shaded squares to Q,,. 


(9.4) Corollary If be M lies over be M, then F,: n,(M, 6) + n,(M, b) is 
an injective isomorphism. 


Proof We know F, is a homomorphism and, using the previous 
theorem with f; 9 loops at b, we see that F > f~ F o g implies f ~ g. This is 
equivalent to F, being injective. | 


We conclude this section by proving two theorems which give a much 
more precise picture of the relation between coverings of a manifold M and 
the fundamental group. They will also enable us to complete our discussion 
of the relation between covering spaces and orbit spaces of a properly 
discontinuous action of a grcup [I on a manifold, which was considered in 
Section II.9. If M, and M, are coverings of a manifold M with covering 
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maps F, and F,, then a homeomorphism G:M,—M™, such that 
F, = F, o Gand F, = F, © G7! is called an isomorphism of the coverings. In 
particular, an automorphism, that is, isomorphism, G; M —> M is exactly a 
covering transformation, as given in Definition III.9.2. Using admissible 
neighborhoods, it is apparent that the differentiability of F, and F, implies 
that of G and G~'. We now show that in a sense isomorphism classes of 
coverings of M are in one-to-one correspondence with subgroups of the 
fundamental group. 


(9.5) Theorem Let F,: M, >M and F,:M,—M be coverings of the 
same manifold M. Suppose that for be M, b, e M, and b,€M, such that 
F,(6,) = b = F2(b,) we have F,,7,(M,,6,) = Fap ni (M3, b2) [as sub- 
groups of ™,(M, b)]. Then there is exactly one isomorphism G: M, > M, 
taking 5, to by. Conversely, if such a G exists, then F,,(1,(M,,6,) = 
F4(72(M 2, b2). 


Proof Given pe M, we define G(j) as follows: Let 7, be a path such 
that f,(0) = 6 and f,(1) = p. Then the path f = F, o f, on M has a unique 
lifting to a path f, on M, covering f and with initial point f,(0) = 6,. We 
define G(p) = f,(1). Of course we must show that the definition is indepen- 
dent of the path f, chosen, and that G is continuous. On the other hand, once 
these facts are proved, then we see that F, = F,°G and that G(b,) = b, 
are immediate consequences of the definition. Uniqueness of G will follow as 
well since any G with the properties required in the theorem must take J, toa 
path f, on M, which covers f = F, o f, and runs from 6, to G(f). This 
applies to loops as well, which proves the converse. 

Now suppose that f, and g, are distinct paths on M, from b, to p. Let 
f=F,of, and g = F, ° , and consider the loop f» g7! with g™!(s) = 
gi—s), O<s<1. This loop determines an element [f*g~'] of 
F 14 7,(M,, 51) and hence also the (same) element of F3, 7, (M3, 53). In view 
of the preceding corollary, if we lift this to a path from b,, its terminal point 
will necessarily be 6,; and so the lifted paths f, and g, on M, beginning at b, 
both end at the same point, that is, f.(1) = g2(1). It follows that by using 
either f; or g, we obtain the same value for G(p). We also see from this line of 
argument that there is a one-to-one correspondence between points of M,, 
i = 1,2, and equivalence classes of paths f on M issuing from b relative to the 
subgroup A = F,(2,(M,, b) < 2,(M, b), where equivalence (x) relative to A 
is defined for paths f,g with f(0) = b = g(0) by f ~g if f(1) = g(1) and 
[f +g ‘Je A. In fact, let pe M, {f} an equivalence class of paths from b to p; 
{f} determines a point P of M, which lies over p. Indeed, the class 
{f} lifts to a class {f}, all curves of which issue from the point 6,; and we 
have just seen that they all have as terminal point ð. Conversely, when 
F: M + M is a covering, fixing b, be F~1(b) defines a correspondence of 
points ĝe M and equivalence classes as follows: Let f be a path from 6 to ĵ; 
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then p<>{F o J}. The classes of loops at b are then in one-to-one correspon- 
dence with the points over b. These remarks should clarify the intuitive 
situation and are useful later. The reader might find it helpful to think of the 
example M = R*, M = T°. 

To continue the proof, it is clear that G is onto and has an inverse G~? 
which is described in a completely symmetrical way to G. Now let 
Ďa = G(Ď,)€ M, and let V, be an admissible coordinate neighborhood of 
p= F(b) on M, i=1,2. We also suppose p(V) = Bi(0) < R” and 
W(p) = 0. If fis a path from b to pon M which lifts to paths f; joining b; to p; 
on M,, i = 1, 2, then we see that this path may be used in the definition of G 
as described above. For any point q in V we have a radial path (in the local 
coordinates), say g,, from p to q and f; = f * g, lifts to paths from Í; to q; in 
the component U, of F; '(V) containing b;, i = 1, 2; thus G(g,) = ĝa. This 
description being unique, and valid for every q € V, we see that G: 0,0, 
is one- -to- sone and onto and in fact may be described as follows: G | U,= = 


(F, | Us) (F| U,). Thus G | U, is a diffeomorphism and since M, is 
covered w open sets = this type, G is differentiable, which completes the 
proof. i 


(9.6) Corollary If F: M —> M is a covering and M is simply condocied, then 
the covering transformations are simply transitive on each set F~! (p). If we fix 
be M and be M with F(b) = b, then these choices determine a natural isomor- 
phism ®: nı(M, b) —>T of the fundamental group of M onto the group of 
covering transformations. 


Proof Suppose that q;,q.¢F ‘(p). We apply Theorem 9.5 with 
M, = M, M, = M to obtain a covering transformation G: M > M such 
that G(q,) = q2. To see that the hypotheses are satisfied it is only necessary 
to note that one M is simply connected, m,(M, q;) = {1}, i = 1, 2; hence 
F, (7, (M, q1)) = {1} = F, (71(M, q2)). We remark that by Theorem TI1.9.3, 
it now follows that group I’ of covering transformations must therefore 
be simply transitive on F7 ‘(p ) for each pe M. 

Having fixed be M and be x '(b), we may establish an isomorphism of 
zı(M, b) and Ff as follows. Let [g] € z,(M, b) and @ the lift of g e [g] to M 
determined by g(0) = 6. We have seen earlier that any two curves g,, ĝa which 
are lifts of homotopic curves are homotopic. In particular two loops of [g] 
with g,(0) = 6 = g,(0), must have the same terminal point b, and must be 
homotopic (with endpoints fixed). Since g is a loop, F(b) = b = F(b,). We 
let ®[g]ef be the covering transformation taking 6 to b, = 9(1). This 
defines ®: z, (M, b) +T. It is easily checked that ® is a homomorphism 
using the arguments of the preceding theorem. If ®[{g]= 1, then 
g(0) = 6 = G(1) so that g determines an element of 2,(M, b). Since this 
group contains only the identity, we have g ~ e; by a homotopy A. Then 
H = F-H is a homotopy of g to e, so [g] = 1 and hence ® is one-to-one. 
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We also see that ® is onto: if G, e Ñ, then let b, = G, (b). There is a path g 
from 6 to 6, and since F(b) = F[G,(6)] by definition of covering transforma- 
tion, g= Fg is a loop at b. It determines [g]e€72,(M, b); and since the 
covering transformation G = ®([f]) agrees with G, on b, G,(6) = b, = 
G(b), we must have G = G, by the results of Chapter HI (or by Lemma 9.1). 


(9.7) Theorem Let M be a connected manifold and b a fixed point of M. 
Then corresponding to each subgroup H c n,(M, b) there is a covering 
F: M > M such that for some b e F~'(b) we have F, n,(M, b) = H. F and M 
are unique to within isomorphism. 


Proof The uniqueness is just the previous theorem, and the proof of 
that theorem also indicates how the space must be constructed. The points 
of M will consist of equivalence classes of paths from b, two such paths f, g 
being equivalent if and only if f (1) = g(1) and [f* g7']e H, g7’ denoting 
the path g` !(s) = g(1 — s), O < s < 1. It follows from the fact that H is a 
subgroup that this is an equivalence; we denote it by f = g and denote by (f} 
the equivalence class of f (or point of M). The projection map F: M > Mis 
defined by F({ f}) = f(1) for any fe { f}. Given any { f}e M, let p = f(1) and 
V, y be a coordinate neighborhood of p on M with y(p) = 0 and W(V) = 
B5 (0), the open n-ball. For each qe V there is a unique path g, from p to q 
corresponding to a radial line in y(V). Then q > {f* g,} defines a map 
Oy: V + M with F ° 0,(q) = F{ f * g} =f *g(1) = q for all q in V. Suppose 
his a path from b to p also and that h x f, that is, [ho f~']¢H. Then it is 
easy to see that 0,(V) ^ 0,(V) = Ø. Indeed, if for some qe V, we have 
{f * ga} = {h * gq}; this would require [f= g * (h*g,')] =[f* h™'] to be 
an element of H, contrary to assumption. We leave it to the reader to check 
that the sets 0,(V) with coordinate maps y ° F define a manifold’ structure 
on M which makes F: M >M a covering with {V, p} as admissible 
neighborhoods. 

Finally, we must establish that F »(7(M, b)) = H, where b = {e,}, the 
point of M determined by the constant path at b. Suppose that f(r), 
0<t< 1,isa loop at b with [f] € H. Then f(0) = f(1) = b and we define a 
one-parameter family f, of paths from b by f(s) = f(st), 0< s,t < 1. a 
F(t) = {A (s)}. Then f(t), 0 < t < 1, is a path on M with F(f(t)) = f(1) 

f(t), hence f covers fand is a loop at b. It is straightforward to check that this 
actually determines an isomorphism F, of ,(M, 6) onto H; we may apply 
methods similar to those already used above. This completes the proof. Jj 


If we take H = {1} we have a very important corollary. 

t It is not easy to show that M has a countable basis of open sets. A direct proof may be 
avoided; lift a Riemannian metric from M to M (Exercise 1) and use Kobayashi and Nomizu 
[1, Vol. I, App. 2]. 
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(9.8) Corollary Every connected manifold M has a simply connected cover- 
ing —called its inverse covering—which is unique to within isomorphism. Choice 
of be F~*(b) for be M determines an isomorphism of 7,(M,b) onto Ÿ the 
group of covering transformations. Then M/T is diffeomorphic to M, that is, M 
is the orbit space of its fundamental group acting properly discontinuously on its 
universal covering M. 

Exercises 

1. Show that if F: M >M isa covering and M is Riemannian, there is a 
unique Riemannian metric on M such that F is a local isometry. 

2. Suppose that the assumptions of Exercise 1 are satisfied and that M is 
compact. Determine the relation of the volumes of M and M in terms of 
their fundamental groups. 

3. Determine the meaning, in terms of F: M > M (a covering), of 
F, (7, (M, b)) being a normal subgroup of z, (M, F(b h)). 

4. The universal covering G of a connected Lie group G has a Lie group 
structure such that the covering map F: G > G is a homomorphism. 


Notes 


Integration on manifolds has two very important applications both of which are in- 
troduced briefly in this chapter. First, integration on Lie groups with respect to an invariant or 
bi-invariant volume element has been crucial in many areas of research on Lie groups and their 
homogeneous spaces. Weyl used it to prove the complete reducibility of representations of 
semisimple Lie groups, a central fact of representation theory (see, for example, Weyl [2]). We 
have seen how this was done for the compact case; the generalization to noncompact. semi- 
simple groups was accomplished by Weyl’s “ unitary trick.” But the integral is also used exten- 
sively to study function spaces on Lie groups and to prove such basic theorems as the 
Peter-Weyl theorem. The reader wishing to go further into these ideas should read relevant 
portions of Chevalley [1] and look at the last chapter of Helgason [1]. Many ideas used in the 
study of Lie groups can be exploited in studying the spaces on which they act, especially 
homogeneous spaces—which are essentially coset spaces of Lie groups. In this case, too, 
the invariant volume element and related theory of integration is crucial in many problems of 
current interest in analysis. Again the reader is referred to Helgason [1] for samples of these 
applications. 

The other important application of integration on manifolds is to algebraic topology via 
de Rham’s theore and Hodge's theorem (see Warner [1]). This approach to topology was 
particularly useful in the study of the topology of Lie groups and homogeneous spaces, for 
which purpose de Rham’s theorem was presumably first conjectured. A survey article by 
Samelson [1] should give the reader some idea of just how crucial this was in the early theory. A 
well-known theorem, the Gauss-Bonnet theorem (O'Neill [1]; Stoker {1]), is a beautiful 
example of how integration may be used to obtain relations between the topology of a manifold 
and some of its local geometric invariants: in this case the curvature. A recent treatise by Greub 
et al. [1] gives a comprehensive treatment of the many relations of differential geometry to 
algebraic topology. including. of course, the generalized Gauss-Bonnet theorem. 

Both Milnor [2] and Guillemin and Pollack [1] have many elegant applications of 
differentiable methods to algebraic topology which are within the grasp of the reader at this 
point: determination of H*(S"), proof of the fundamental theorem of algebra by use of the notion 
of index of a vector field, and the Poincaré-Hopf theorem on the sum of indices of singularities of 
a vector field. 
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We begin this chapter by showing very briefly how differential calculus can be applied to 
study the geometry of curves in Euclidean space E” (or R"), especially plane curves (n = 2) and 
space curves (n = 3). The geometric concepts which are discussed—arclength. curvature, and 
torsion—will be familiar to many readers as will the basic tool used, namely differentiation of a 
vector field along a curve. In the second section differentiation of vector fields along curves is 
used again to define and study differentiation of vector fields on a special class of Riemannian 
manifolds—those which are imbedded (or immersed) in £” and carry the induced Riemannian 
metric. These same ideas can be used to investigate the geometry of surfaces in E*, which is the 
subject matter of much of classical differential geometry. However, our main objective is to use 
this situation as a model in order to define differentiation of vector fields on an arbitrary 
Riemannian manifold M. This is done in Section 3 where the Riemannian connection V is 
defined and its existence and uniqueness (depending only on the Riemannian metric) is 
demonstrated. We define in essence, a sort of directional derivative of vector fields Y on M, 
Vx, Y giving the rate of change of Y at pe M in the direction of X,. It generalizes X ,f the 
derivative of a function, which was defined at the beginning of Chapter IV. As might be 
expected, it is more complicated than differentiation in Euclidean space, where we can take 
advantage of the natural parallelism. Conversely, it can itself be used to introduce a more 
restricted type of parallelism on arbitrary Riemannian manifolds. Once the basic properties of 
Vx Y are established, we are ready to apply differential calculus to the study of Riemannian 
manifolds. 

We very briefly and formally introduce the Riemannian curvature tensor—an important 
geometric object which we study in the next chapter. The remainder of this chapter is spent on 
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the study of geodesics, which generalize to Riemannian manifolds the straight lines of Euclidean 
geometry. They have important similarities to straight lines as well as important differences. 
For example, like straight lines of Euclidean space, the unit tangent vector is constant (has 
derivative zero) as we move along the curve. Although geodesics in general are not the curves of 
shortest length between any two of their points, they will have this property for nearby points, 
and so on. Only the most basic properties can be proved in such a brief treatment, but enough is 
established to reveal the interesting variety of phenomena which occur for geodesics in general 
manifolds. 

In the final section some important examples are considered, namely the Riemannian 
symmetric spaces, which have even more similarities with Euclidean space than the usual 
Riemannian manifold. In particular, the space of non-Euclidean geometry is a symmetric space. 
The examples considered here also have important curvature properties which are discussed (in 
part) in Chapter VIII. 

The presentation of Sections 7 and 8 was very much influenced by Milnor [1], especially his 
Sections 10, 20, and 21. 


1 Differentiation of Vector Fields along Curves in R” 


In order to clarify the definition of differentiation given later for general 
Riemannian manifolds, we shall first consider some special situations in the 
oriented Riemannian manifold R” (with the standard orientation and inner 
product). In doing so we will make full use of the natural parallelism in R", 
that is, the natural identification of the tangent spaces at distinct points. 

Let C be a curve in R" given by x(t) = (x'(t),..., x"(t)) witha < t < b. 
We suppose that Z(t) = Zx is a vector field defined along C; thus to each 
te (a, b) is assigned a vector (Fig. VII.1): 


z(=}, aof) £ Tea (R”). 


Figure VII.1 


We will suppose Z to be of class Ct, at least, which means that the compon- 
ents a(t) are continuously differentiable functions of t on the interval (a, b). 
The velocity vector of the (pàrametrized) curve itself is an example—in this 
case a'(t) = x(t). 
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Our purpose is to define a derivative or rate of change of Z(t) with 
respect to t; it will be denoted Z(t) or dZ/dt and will again be a vector field 
along the curve. Of course, in general, neither Z(t) nor its derivative need be 
tangent to the curve. Now since in R” we have a natural parallelism (or 
natural isomorphism) of 7,(R") and T,(R") for any distinct p, q e R", we are 
able to give meaning to Z(ty + At) — Z(to), the difference of a vector in 
Tyio+ay(R") and a vector in Tyy(R"). For definiteness we suppose 
Z(t) + At) moved to or identified with the corresponding vector in Taal 8") 
and that the subtraction is performed there (Fig. VII.2). This allows us to 
define the differential quotient 


1 Z a'(to + At) — a'(to)( ô 
a [Alto + At) — Z(to)] = x At (=). 


L 


Z(fot+ AS) 


Z (ty) 


X(fy+B?) 


Zlh+tA N-Z) 


x (h) 
Figure VII.2 

The equality is due to the fact that if we write vectors in terms of the basis 

/ðx", ..., 0/ðx" which is a field of parallel frames on R”, then vectors at 


distinct points, say Z(t) + At) and Z(to), are parallel ifand only if they have 
the same components. Passing to the limit as At > 0 gives the definition 
; dZ 3 é 
1.1 Z(to) = |—-] = ælt — Tao (R”). 
aD Zo) = (T) = Laleol a) E Teo 


(1.2) Remark A consequence of this formula is that if we introduce a new 
parameter on the curve, say s, by t = f(s) with to = f (So), then 


dZ\ _ (dt\ (dZ\ | 
ds},  \ds},.\ dt Jr 
(dt/ds),, is a scalar; the other terms are vectors. 
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As a simple example consider the curve x(t) = (cos t, sin t), a unit circle 
in R?. Suppose Z(t) = —sin t(@/0x) + cos t(d/éy); this is the velocity vector 
of the point which traces out the circle. Then dZ/dt = —cos t(d/0x) — 
sin t(@/dy) is a vector at x(t) = (cos t, sin t) which has constant length +1 
and points toward the origin (Fig. VII.3). 


Z(t) 


Figure VII.3 


(1.3) Definition A vector field Z(t) is constant or parallel along the curve 
x(t) if and only if dZ/dt = 0 for all t. 


Suppose that Z,(t) and Z(t) are vector fields of the above type defined 
along the same curve C and that f(t) is a differentiable function of t on 
a<t<b.Thenf(t)Z(t) and Z,(t) + Z,(t) are vector fields along C and we 
have the following easy consequences of formula (1.1). 


(14a) SZO + Zale) = St + 2, 
(14) t (riz) = F200) +110, 
(1c) 4 (Z,(0, Zale) = (SG Zale) + (20.3) 


where (Z,, Z2) is the standard inner product in R”. 

The formula we have given for dZ/dt is in terms of the components of 
Z(t) relative to the natural field of frames 6/éx', ..., 0/6x" in R", which are 
constant along x(t). However, we sometimes find it convenient to use some 
other field of frames, say F,(t),..., F,(t), defined and of class C’ at least 
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along x(t). Then Z(t) has a unique expression as a linear combination of 
these vectors at each x(t): 


Z(t) = b*(t)F (t) + +++ + b'(t)F,,(t). 

Differentiating this expression we obtain, with the aid of (1.4a,b), 
dZ & (db „dF; 
— = — F(t) + b/(t)—2]. 
d (T PEP i) 
However, since dF ,/dt are vectors along x(t), they too are linear combina- 
tions of F,(t), 

dF, 2 

TI Y AOFM) 


k=1 


This gives the formula 


as = 3 (T ay Hodo) Fale 


This includes (1.1) as a special case since a¥(t)=0 when the frames 
F,(t),..., F,(t) are parallel. 

Although we have used a particular coordinate system, the natural one in 
R”, in fact 

dZ l 

— = lim — [Z(t At) — Z(t 
Fg ere) 20) 
is an object which depends only on the geometry of the space; it is indepen- 
dent of coordinates, so it is defined equally well for parametrized curves 
in E”. 


The Geometry of Space Curves 


As an illustration of these ideas we derive the Frenet—Serret formulas for 
a curve of class C? in R?. We first note that length of the curve from a fixed 
point Xo = x(to) is given by s= fi, ((x(t), x(t))' dt so that ds/dt = 
(x(t), X(t))'/?. If s is used as parameter, then ds/dt = ds/ds = 1 so that X(s) is 
a unit vector tangent to the curve. We let T(s) = X(s) denote this unit tangent 
vector. Because arclength, the parameter s (to within an additive constant), 
and T(s) are determined by the (induced) Riemannian metric on x(s), not by 
the particular rectangular Cartesian coordinates or origin used, they and the 
derivatives of T(s) are geometric invariants of the curve, that is, they will be 
the same at corresponding points for congruent curves. Differentiating the 
identity (T(s), T(s)) = 1 and using (1.4c), we obtain 2(T(s), dT/ds) = 0. 
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Therefore dT/ds is zero, or is a nonzero vector orthogonal to T(s) at each 
point of the curve. We define the curvature k(s) by k(s) = |\d T /ds||; and when 
k(s) + 0, we let N(s) be the unique unit vector defined by dT/ds = k(s)N(s). 
When k(s) # 0, let B(s) be the uniquely determined unit vector such that 


T(s), N(s),,B(s) define an orthonormal frame with the orientation of 0/0x’, 
ĝ/ðx?, 0/ðx? (see Fig. VII.4). This determines a field of orthonormal frames 


Figure VII.4 


i! 


along the curve, defined whenever k(s)+ 0, which we shall henceforth 
assume is the case at all points of the curve under consideration. This 
assumption is justified since it is the generic or typical situation for a space 
curve. In fact, we have the following fact concerning a curve for which k(s) 
vanishes identically. 


(1.6) Theorem If k(s)=0 on the interval of definition, then x(s) is a 
straight line segment on that interval, and conversely, for a straight line, 
k(s) = 0. 

Proof If the curve is a straight line, then it is given in terms of arclength 
by x'(s) = a! + b's, i = 1, 2, 3, where J=; (b*)? = 1. Thus 

T =} ju b' ôx 

and dT /ds = 0. Conversely, if k(s) = 0, then dT/ds = 0. Since T = $ dx'/ds 
a/ax', s being arclength, this implies d’x'/ds? =0, i=1, 2, 3. Thus 
xi(s) = a + b's, i = 1,2, 3, with a and b’ constants, and the curve is a 
straight line. Note that T(s) and k(s) are defined for a curve in R”, any n (not 
just n = 3), and the proposition just proved is still valid. 


Now, returning to the study of a curve in three-space, for convenience of 
notation we let F,(s), F(s), F3(s) denote T(s), N(s), B(s), respectively. Since 
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this is a field of orthonormal frames we have (F;(s), F ,(s)) = 6,;. Differentia- 
tion of these equations gives the relations 


dF; dF\ _ ae 
(T Fi) + (Fu. zh) = 0, i,j = 1,2, 3. 


As we pointed out in the derivation, dF /ds must be a linear combination of 
the F,(s) for every s, so we may write 


dF, i 
Fa E 2 ajFis), j=1,2,3. 
This combines with the previous equations to give 


(z aF, , F) + (F; Zar) =0 
k k 
or 

al(s)+ai(s)=0, 1<ij<3. 
This means that the matrix (a‘(s)) is skew-symmetric. By definition dT /ds = 
k(s)N, that is, a?(s) = k(s), and so a}(s) = 0 = ai(s). Let a3(s) = t(s) as a 
matter of notation. Then rewriting in terms of T, N, B, we have the Frenet- 
Serret formulas: 


(1.7) — = k(s)N, 


dB 

ds ` 
expressing the derivatives with respect to s of T, N, and B which are called 
the tangent, normal, and binormal vectors, respectively, of x(s), in terms of 
these vectors themselves. Formula (1.7) defines the functions k(s) and t(s) 
along the curve. 


—1(s)N, 


(1.8) Definition k(s)is called the curvature and 1(s) the torsion of the curve 
C at x(s). 


The curvature measures deviation of C from a straight line and the 
torsion measures “twisting” or deviation of C from being a plane curve. Of 
course, T, N, and B as well as curvature and torsion are independent of the 
coordinates used in the Euclidean space containing C. 


(1.9) Theorem A curve in E? lies in a plane if and only if t(s) = 0. 


Proof If the curve lies in a plane, then from the definition of T(s) and 
dT /ds we see that these vectors lie in the plane of the curve for each point 
x(s) of the curve. Thus the unit vector B(s) has a fixed direction, orthogonal 
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to the plane, and so is always parallel to a fixed unit vector orthogonal to the 
plane. Therefore dB/ds = 0 and t(s) = 0 

Suppose that t(s) = 0. Then dB/ds = 0 and B is a constant vector along 
the curve. We choose the coordinate axes so that the curve passes through 
the origin 0 at s = 0 and so that B(s) is parallel to 6/éx3, the unit vector in 
the direction of the x°-axis. Then x(s) = (x'(s), x7(s), x°(s)) determines the 
vector x(s) from the origin 0 to the point x(s) on the curve. Differentiating 
(x(s), B(s)), we have 


© l) BE) = (Ths) B6) + (x09) T) = (TO BO) = 0 


so that (x(s), B(s)) is constant. Since x(so) = 0, that is, x(50) = = 0, the vector 
x(s) [or line Ox(s )] is always perpendicular to B = 6/6x>. Thus the curve lies 
in the x'x?-plane. | 


The advantage of using arclength s as parameter on the curve C and the 
frames T(s), N(s), B(s) is that they all have intrinsic geometric meaning, 
depending as they do only on the Riemannian structure of the ambient space 
E? and the nature of the curve itself and not on any coordinates that we 
might use in Æ’. Although the formulas for the derivative of a vector field g 
along the curve are more complicated, since they involve the second terms ak 
in (1.5) which vanish when we use parallel frames along C, nevertheless the 
advantage oi being geometrically determined is quite crucial. Even the 
coefficients aj, which here are £K(s ) and +1(s) or zero, have geometric 
meaning as we have seen. In fact it is not difficult to show (see, for example, 

O’Neill [1]) that k(s) and t(s) determine C up to congruence. 

As an illustration of the utility of the Frenet frames T, N, B we consider 
briefly the dynamics of a moving particle in space whose position p(t) is 
given as a function of time t. x s(t) be the length of path traversed from 
time t=0 to time t, = fo ((dp/dt, dp/dt))'* dt. Then ds/dt = 
((dp/dt, dp/dt))'?. = \dp/d is is oe a with which the particle moves along 
the curve, while its velocity vector is given by 


_ dp dpds _ ds 
W= ed de 


where T is the unit tangent vector. Differentiating we obtain the acceleration 


ap dT a (a) T d?s 


th= = —. 
a dt? dt dt? 
Since dT /ds = kN this becomes 


d*s ds\? 
(110) a(t) = 437 + (T) N. 
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Thus the acceleration decomposes into the sum of two vectors, one in the 
direction of the curve, whose magnitude is the time rate of change of the 
speed d?s/dt?, and the other normal to the curve and directly proportional to 
both the square of the speed and to the curvature, this latter depending only 
on the curve. If the motion is a straight line motion, then k = 0 so that a has 
the direction of the line. If the particle moves at constant speed, so that 
d°s/dt? = 0, then the acceleration depends only on the shape of the path. 
The same remarks also apply to the force F acting on the particle, which by 
Newton’s second law, F = ma, is proportional to a with the mass m as 
constant of proportionality. 


Curvature of Plane Curves 


Some special comment is required for the case of a curve C lying on an 
oriented plane. Let s — (x(s), y(s)) define the curve, parametrized by 
arclength. Then T = (s) d/€x + j(s) 6/@y is the unit tangent vector. If 
dT/ds + 0, then we may as before define k(s) = ||\dT/ds||, that is, we may 
consider the curve as a space curve (x(s), y(s), 0) whose z-coordinate 
z(s) = 0, and use the same definitions. However, for plane curves a more 
refined definition of curvature is possible: At each point of C choose N so 
that T, N have the same orientation as 4/éx, 0/éy (this uniquely determines 
T, N). (See Fig. VII.5.) Then define the curvature k(s) so that k(s)N = dT jds. 
This allows k(s) to be negative, zero, or positive. The curvature thus defined 
for a plane curve has the previously defined curvature of C (considered as a 
space curve) as its absolute value, k(s) = | k(s)|. To carry our interpretation 
somewhat further let O(s) be the angle of T with the positive x-axis 
(Fig. VII.S). Then 


ô : ô 
T{s) = cos 8(s) ax + sin A , 
and 
dT sea ô 6 
ae —6(s) sin Os) = + @(s) cos io Fe 
AN =dT/ds 
a 


Figure VII5 
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The unit vector N(s) chosen so that T(s), N(s) is an oriented orthonormal 
basis is N(s) = —sin 0(2/€x) + cos @(2/éy), since the determinant of the 
coefficients of T, N as combinations of @/éx, ĉ/ĉy is 


Jä cos sin aa] 
—sin@ cosé}) j 


Thus Å(s) = 0(s) or d0/ds, the rate of turning of the tangent vector T with 
respect to arclength. Moving along C in the direction of increasing s the 
curvature is positive when the tangent is turning counterclockwise and nega- 
tive otherwise. Its sign depends on the sense of the curve (direction of in- 
creasing s) and the orientation of the plane, but not on the coordinates. 

Suppose C is a circle of radius r. Then s > (r cos(s/r), r sin(s/r)) gives the 
curve parametrized by arclength: 


Since 


is the unique unit vector such that T, N has the orientation of ĉ/ĉx, ¢/@y, we 
see that k(s) = 1/r. Thus the curvature is a constant. If, as we have assumed 
by our parametrization, the circle is traversed in the counterclockwise sense, 
it is a positive number; in any case its magnitude is inversely proportional to 
the radius. 

Returning momentarily to the dynamics of a moving particle we see that 
if a particle moves on a circle in such a way that its speed is constant vo, then 
the force F acting on the particle is 


F=ma= —N. 
Since N is the unit normal vector, F is directed toward the center of the 


circle and its magnitude is mvg/r which gives the usual formula for the 
centripetal force necessary to keep the particle in a circular orbit. 


Exercises 
1. Prove that a curve in R? for which t(s) = 0 and k(s) is a constant, k # 0, 


is a circle. 
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2. A helix is defined by parametric equations of the form x! = a cos t, 
x? = asin t and x? = bt, where a, b are positive constants. Determine k 
and q for a helix. [Hint: First change to arclength as parameter. ] 

3. Let Z, and Z, be two vector fields along a curve in R? and let Z, x Z, 
be their cross product (as defined in a three-dimensional vector space). 
Show that 

LG, x z) = x Z,+ Z, x 2, 

4. Show that the plane which closest approximates a curve C at p is 
spanned by T and N. This plane is the limiting position of the plane 
through p’, p, p” on C as p’, p” approach p. 

5. Using the technique of Exercise 4, find the center of the best approximat- 
ing circle and of the best approximating sphere to each point p of C. 
[They should be located relative to the moving coordinate frame 
T, N, B.] 

6. Show that if the plane of T and N goes through some fixed point 0 of E? 
for every point on C, then C lies in a plane. 


2 Differentiation of Vector Fields on Submanifolds of R” 


In the previous section we studied differentiation of vector fields along 
curves, which includes, of course, one-dimensional submanifolds of 
Euclidean space. In this section we do the same for vector fields “along” 
other submanifolds M c R”, for example a surface in R?. This is somewhat 
more complicated and certainly not the most direct way of approaching the 
subject of differentiation on manifolds; but it should help our geometric 
understanding. Just as in the case of a curve, we are concerned with a vector 
field Z defined at each point of M but not necessarily tangent to M (see 
Fig. VII.6), that is, to each pe M, we assign Z, € T,(R"). When Z is such that 


Figure VII.6 
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Z, is tangent to M, Z,€ T,(M) = T,(R"), then we shall say that Z is a vector 
field on M or a tangent vector field. Only in this case has Z meaning for M as 
an abstract manifold, independent of any imbedding or immersiont in R". In 
any case differentiability of Z may be given meaning since its components 
relative to the canonical frames of R” at points of M will be functions on M, 
Z, = Y2-, a*(p)(é/6x*),, and by definition we shall say that Z is of class C' 
if a*(p), æ = 1,...,n, are of class C” on M. In particular, the vector fields 
€/éx', ..., 6/6x" of R", restricted to M, are C”-vector fields along M (but 
rarely on M). 

If pe M, then T,(R") and its subspace T,(M) carry the standard inner 
product of R" so M has the induced Riemannian metric. This allows us to 
decompose any vector Z,, p€ M, in a unique way into Z, = Z, + Z, with 
Z€ T,(M) and Ze T (M), the orthogonal complement of 7,(M). This 
reflects the direct sum decomposition of 7,(R”) into mutually orthogonal 
subspaces: T,(R") = T,(M)@®7T;(M) called the tangent space and the 
normal space to M at p. Let 2’, n” denote the corresponding projections: 
n(Z,) = Z, and x"(Z,) = Z}; they are linear mappings of T,(R") onto the 
subspaces tangent and normal to M. Figures VII.7 and VII.8 illustrate this 
decomposition for a curve and surface in R°. 


Figure VII.7 


Suppose that Z is a vector field along M of class C’. Then n'(Z) and n”(Z) 
are also vector fields, which are tangent and normal to M, provided that they 
are differentiable. We leave the proof of the following lemma (including the 
assertion of C’ differentiability) to the exercises. 


(2.1) Lemma With Z as above n(Z) and n"(Z ) define mutually orthogonal 
C’-vector fields Z', Z" along M such that Z = Z' + Z”, that is, at each pe M, 


+ Since we consider only local questions in this section, we may restrict ourselves to 
imbedded (regular) submanifolds by Theorem HI.4.12. 
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Figure VII.8 


Z',€ T,(M) and (Z',, Z) = 0. If f is a function of class C" on M, then n'( fZ) = 
fn'(Z) and n"( fZ) = fr'(Z). Further, given two such vector fields Z,, Z, , then 
n(Z, + Z2) =n (Z,) + 2 (Zz) and n"(Z, + Z2) = n"(Z,) + 2"(Z>). 


As examples we note that a vector field Z along a curve decomposes 
uniquely into the sum of a tangent vector field and a vector field in the 
normal plane: 2(Z)=(Z,T)T and 2"(Z) = (Z,N)N + (Z, B)B (see 
Fig. VII.7). For the case of an arbitrary C” imbedded manifold M, we see 
that 2’(0/0x*) applied at each pe M gives a C® tangent vector field to M for 
each & = l,..., n. 

Now let Y be a tangent vector field to M c R”, that is, for each 
pe M, Y,€ T,(M), or equivalently n'(Y) = Y. If p(t) is a curve on M of class 
C’ or higher, defined for some interval of values of t, then Y(t) = Y, is a 
vector field along the curve. As such we can ignore M and differentiate Y(t) 
as a vector field along a curve in R” obtaining dY/dt, another vector field 
along the curve. In general, of course, dY/dt will not be tangent to M; 
however, at each point p(t) we may project dY/dt to a tangent vector 
n (dY /dt). 


(2.2) Definition The projection n'(dY /dt) will be denoted DY /dt and will 
be called the covariant derivative of the tangent vector field Y on M along the 
curve p(t) (see Fig. VII.9). 


Both Y and DY/dt are tangent vector fields, and thus have meaning for 
the abstract manifold M. However, the process by which DY /dt is obtained 
from Y and p(t) makes use of the imbedding of M in R". Our ultimate aim is 
to obtain for an abstract Riemannian manifold M a definition of derivative 
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Yair) 
z 
at 


Figure VII.9 


of a vector field along a curve which does not make use of an imbedding but 
is intrinsic to the Riemannian and differentiable structure of M itself. In the 
meantime we shall study in more detail the properties of DY /dr and will see 
that it has properties similar to the derivative discussed in Section 1. 

It is important to note that Y (t) need not be the restriction to a curve p(t) 
of a vector field Y on M—as above—in order for DY/dt to be defined. We 
need only suppose that Y(t) is a vector field along p(t), so defined that it is 
always tangent to M, that is, such that Y(t)e T,)(M). Then, as above, 
DY Jdt = n'(dY /dt), where dY/dt is the derivative of the vector field along a 
curve as defined in the previous section. Now suppose as in (1.4a-c) that we 
have vector fields Y,(t) and Y,(t) along p(t) on M and tangent to M. Then we 
have corresponding properties. 


(2.3) Theorem With Y(t), Y,(t), Y2(t) as above and f(t) a C’ function of t 
we have 


(2.3a) 5 (Mi + Y,)= 2a + a 

d 
(2.3b) =,( FOYE) = = Y(t) + fy, 
(2.3¢) 5 (Ye Y) = Ci r) aa (¥.77) 


The last equation concerns the induced Riemannian metric on M, that is, 
the inner product on T,(M), at each pe M, induced by the inner product in 
T,(R"). These properties are immediate consequences of the definitions, the 
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properties of x’, and of the corresponding statements of (1.4a—c). Applying z’ 
to both sides of (1.4a) and using linearity gives (2.3a). Similarly using (1.4), 
we see that (2.3b) holds: 


D d df df DY 

P m=i) Ars -gr 
The last property follows from (1.4c) if we remark that 

aY, (AYA AN BY (dX) iaa 

E al ml a eee 


and that n”(dY,/dt) is orthogonal to T,,)(M) so that 
DY, dY, DY, „[dYz DY, DY.) 
mans ni „Y. Y, —— 
(eha) a(r a} Vat Fan male Goan 
(2.4) Remark If we change to a new parameter, say s, by t = f(s), then 
DY /ds = (DY /dt)(dt/ds), dt/ds = f'(s) being a scalar. This also follows from 


applying n’ to the similar relation dY/ds = (dY/dt)(dt/ds) of the previous 
section. 


(2.5) Definition Given M c R" as above, let Yp be a vector field defined 
at each point of a curve p(t) on M and which at each point is tangent to M, 
that is, a vector field along p(t) tangent to M. Then we shall say that Y, is a 
constant or parallel vector field if DY /dt = 0. More generally if Y is a tangent 
vector field on all of M, then it is constant or parallel if it has this property 
along every curve on M. 


It is very important to note that DY/dt may be identically zero even 
though dY/dt is not, thus a vector field along a curve may be constant 
considered as a vector field on a submanifold M of R” even though it is not 
constant considered as a vector field along the same curve in R”. This obser- 
vation is a crucial point in some of our subsequent discussions, so we give a 
simple example. Let M = S', the unit circle in R?. Then t — (cos t, sin t) is 
the parametric representation and it may be considered as defining a curve 
on M. Let Y(t) be the unit tangent vector to this curve. As we have seen 
dY/dt is orthogonal to Y(t), that is, normal to M; hence DY/dt = 
n'(dY /dt) = 0, although dY/dt is never zero and in fact has constant length 
+1. Since any great circle on the unit sphere S”~' < R” is congruent to the 
great circle t > p(t) = (cos t, sin £, 0, ..., 0) on the intersection of S"~* and 
the 2-plane x? = --- = x" = 0 of R", we see that the unit tangent vector to 
any great circle arc p(t), parametrized by arclength, has the same property: 


DY _D{dp\ _ 4 
dt dt\dt} ` 


PDF pdfFactory Pro www.fineprint.cn 


312 VH DIFFERENTIATION ON RIEMANNIAN MANIFOLDS 


This is admittedly a special case; in general the derivative of a tangent 
vector field to M along a curve p(t) in M has both normal and tangential 
components different from zero. If a curve on M is such that (D/dr)- 
(dp/dt) = 0, that is, the (covariant)derivative of the unit tangent vector to the 
curve is zero along the curve, then we shall say the curve is a geodesic of M. 
We have just seen that the great circles on the unit sphere in R" are 
geodesics. In the case in which M is an open subset of R” or all of R”, then 
dY /dt = DY jdt, that is, in R” itself, as might be expected, covariant differen- 
tiation is just the usual differentiation. In this special case, according to 
Theorem 1.6, the only curves p(t) for which 


D (dp) _ d (ap 
dt\dt) — dt\dt 
vanishes identically are straight lines parametrized by arclength—or with t 
proportional to arclength. Thus geodesics on an imbedded manifold M are 
those curves which in some sense generalize the concept of straight line— 


even though they may not look “straight” when viewed from the ambient 
space R”. We shall study these questions in some detail later in this chapter. 


Formulas for Covariant Derivatives 


In order to further study the covariant derivative DY /dt of a tangent 
vector field Y on M along a curve we will need more detailed computations 
using local coordinates. Suppose dim M = mand that U, ọ is a local coor- 
dinate system on M with ọ(U) = W, an open subset of R". We denote the 
local coordinates by u', ..., u” and remark that p7: W > R" is an imbed- 
ding of W whose image is, of course, U—an open subset of M. We have 
previously referred to pg” ' as a parametrization of M. Let u = (u', ..., u™), 


then 
gy *(u) = (g (u), ..-, 9"(u)), ue W, 


gives y~* in terms of its coordinate mappings g*(u). [We let «, B, y, and so 
on, denote indices that range from 1 to n and i,j,k, and so on, indices 
ranging from 1 to m.] The coordinate frames will be denoted F}, ..., Fm; they 
span the tangent space to M at each point. Since this tangent space T,(M) at 
péM is a subspace of 7,(R"), these vectors are linear combinations of 
6/éx', ..., 0/Ox". In fact, generalizing earlier formulas for m = 2 and n = 3 
(Example IV.1.10) we have: 


ê " fg) ô 
=o- L| = ay £. 
(2.6) Fip a Px (l z (h ôx“ 


1 


Now suppose that p(t) is a curve on M of class C' and that Y(t) = Y, is 


a vector field along the curve which is always tangent to M. Then Y(t) may 
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be written as a linear combination of F,,...,F,,, so that Y(t) = 
Yi, OX(t)F,. Although 


is not tangent to M in general, projecting we obtain 


DY dY m {db dF 
gee (ell SF, + ben’ (“ 
«(| ae kt (a) 


dt izi 


or 
DY m  (db* DF 
2.7 — = — bk _*) 
u dt XG ae i 
We know, however, that DF;/dt, i = 1,...,m, are vectors tangent to M 
and may be expressed as linear combinations of Fy, ..., Fm. Suppose that 


the curve p(t) is given in local coordinates by e(p(t)) = (u(t), ..., u(t)), 

then in (2.6) the components are (composite) functions (297 /Cu') eine Of t 

through u'(t),..., v(t), and at each p(t) 

DF; (5) 2 & ĉg dw ( ĉ ) 
=7 


at Va) a 2 aad a Ne 
by the ordinary chain rule of differentiation applied to (2.6), and the proper- 
ties of x: The derivatives ¢?g*/éu/ Gu‘ are functions of u',..., u" and are 
evaluated at u(t) = (u'(¢),..., u™(t)) in this formula. 


We have previously remarked that when M is imbedded in R” by a C” 
imbedding—which we shall always assume—then ĉ/ĉx* restricted to M is a 
C* vector field along M. By Lemma 2.1, n'(ĉ/êx*) defines a C” tangent 
vector field on M, which must have then a unique expression of the form 
m(é/Cx*) = X k- ak(u)F, on U. The ak(u) are C* functions on M which we 
do not compute. Using them and the coordinate functions g*(u) of the par- 
ametrization @~' we define the C* functions I% (u) as 

24% 
= re, aer L<ij,k<m. 


tJ 
x 


Symmetry in i, j is a consequence of interchangeability of the order of differ- 
entiation. We do not explicitly compute these functions now, but we use 
them to write new formulas for DF,/dt: 


DF; = dui 


It -Fis i=1,....m. 


(2:8) dt |. 2 1 dt 


at each p = p(t), the T% being evaluated at (u'(r),..., u”(t)). A particular 
case, the curve given by u' = constant for i # j and w = t, gives the formula 
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for the covariant derivative of the vector field F; along the jth coordinate 
curve, conveniently denoted DF;/du/: 


This gives an interpretation of the meaning of T$ (u); it is the kth component 
(relative to the coordinate frames) of the covariant derivative of F; along 
that curve in which only the jth coordinate is allowed to vary, that is, along a 
coordinate curve. Using these formulas we may finally write (2.7) in the form 
in which we want it (after appropriate change of indices): 


m db‘ m 
(2.9) anaa + ¥ r (u(t)bi(s) oh. 

= i j=1 

This formula gives us the analog to formula (1.5) in that DY/dt is ex- 

pressed in terms of the field of frames Fy,..., F, on U c M, frames defined 
independently of either p(t) or Y. The companen of the covariant deriva- 
tive are the terms in brackets. The functions T$ (u) are defined over all of U 
and in (2.9) are evaluated at points of the curve. Indeed for every coordinate 
neighborhood on M we have frames F;, i = 1,...,m, and functions ri, 
which give DF,/du/. From these data DY /dt can then be computed according 
to (2.9) by ordinary differentiation of the components of Y and coordinates 


of p(t). 


Vx, Y and Differentiation of Vector Fields 


We will now change our point of view slightly in deriving some con- 
sequences of formula (2.9). Let Y be a tangent vector field on M which is 
defined everywhere—not just along some curve. On the coordinate neigh- 
borhood U we write Y = )7_, b*(u)F,. Let p be a point of U such that 
olp) = (ui, ..., um), and let X, be a tangent vector at p, X, = }, aF jp, a 


constant for j = 1,..., m. Now choose any differentiable curve p(t) what- 
soever with Anan = i A Silla = X,, so that in local coordinates it is 
defined by u(t) = (u' (t (t)) with u‘(to) = ub and (du‘/dt),, = a’. Then 


we may oe ova: ie - above with a surprising result. First we 
observe that Y(t) = ¥: b*(u(t))F, implies that 


db* m og : 
pie = — j— k 
| dt L p (re) 2 i l 


Taking this into consideration, (2.9) may be written in modified form as 


(2.10) (" = 


ih = (x pbk + ET (to)b (uo) Fi | 


PDF pdfFactory Pro www.fineprint.cn 


2 DIFFERENTIATION OF VECTOR FIELDS 315 


A careful examination of this formula discloses the remarkable fact that 
the right-hand side does not depend on p(t) but only on its tangent vector X, 
at p. Since (D¥/dt),, is a vector in T,(M), this formula defines a mapping of 
T,(M) to itself X, > (DY/dt),,. We introduce the notation Vy, Y for the 
image of X,,, that is, we define Vy, Y = (DY/dt),, along any curve p(t) with 
P(to) = p and (dp/dt),, = X,. We have defined previously a “directional 
derivative” X „f of a function f with respect to a vector X ,; what we have just 
now done is define in similar fashion a rate of change of the vector field Y at 
p in the direction X,,. It is measured by a vector Vy, Y. 

It is worth commenting that along the curve p(t) we have at each point 
Vapa Y = DY/dt as a consequence of our notation. We summarize the essen- 
tial properties of Vy, Y in a theorem. 


(2.11) Theorem Let M c R" be a submanifold. For any tangent vector field 
Y of class C’,r > 1, on M we have at each point pe M a linear mapping 
Xp > Vx, Y of T,(M) > T,(M). Then Vx, Y, being defined as above, has the 
following properties: 


(1) If X,Y are vector fields of class C" (of class C”) on M, then Vy Y 
defined by (Vx Y), = Vx, Y isa C’~! (respectively, C®) vector field on M. 

(2) The map T,{M) x ¥(M)— T,M) given by (X p, Y) > Vx, Y is R- 
linear in X „ and Y. For a function f, differentiable on a neighborhood of p, 


Vx ASV) = (X S)¥, + f(P)Vx, Y- 


(3) If X, Ye X(M), then [X, Y] = Vx Y — Vy X. 
(4) If Y, and Y, are vector fields and (Y,, Y,) their inner product, then 
XY, Ys) = (Vx, Yis Yop) + (Yip Vx, Y2). 


Proof Let Y = } b‘F, and X = Y a*F, in the notation just used. The 
b* are functions of the local coordinates (u', ..., u”) and so are the a* when X 
is a vector field. Since X „b* = )'"_ , (Cb*/Cu’)a’, the definition Vx, Y= DYjdt 
and (2.10) imply that 


wrer J (e +E rb) F.. 

k j 
From this formula, valid for each pe U, it is clear that properties (1) and (2) 
hold, whereas (4) is just the earlier property (2.3c) of DY /dt. [Again note that 
Xf = dfidt, the derivative of f(p(t)), when we assume X, = dp/dt; in parti- 
cular this holds for f = (Y,, Y).] Only property (3) requires more careful 
verification. We will verify (3) by direct computation in a coordinate neigh- 
borhood U, ¢ using our previous notation. With X and Y given on U as 
above we compute [X, Y]: 


E (Ma Mo, 
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Using the formula for (DY/dt),, we compute Vy, Y — Vy, X. We have 


vz, Y -V X =F (HM a-y ÎE b)+ Freie! — abby be 
— = r = ij = ` 
Xp Yp = bul oui a Oa k 
However, since I‘, = i; this reduces to the first term in parentheses, the 
second sum being zero. Thus (3) follows. | 


We conclude with several remarks. First, a careful reexamination of what 
we have done will show that Vy, Y depends for its definition only on the 
Euclidean structure of R”, that is, on E” and on the imbedding of M in E”. It 
is independent of local coordinates, although we use them in its definition 
and in the proof above. However, dY/dt and DY/dt = n'(dY/dt) are geomet- 
ric in nature and so is Vx, Y. 

Secondly, if Vy, Y is axiomatized and defined first, then DY /dt could be 
introduced by DY/dt = Vapa Y and we could reverse our definitions and 
steps above. 

Finally we note that although there is a partial duality of roles of X and 
Y in the symbol Vx Y, which in fact defines an R-bilinear mapping of 
¥(M) x ¥(M) > ¥(M) by (X, Y) > V, Y, actually there is an important 
difference in the roles of X and Y. Namely this mapping is C*(M)-linear in 
the first variable (Exercise 3) but not the second [by (2) of Theorem 2.11]. 

In this connection we observe that when X and Y are vector fields on M, 
then the Lie derivative Ly Y = [X, Y] gives a rate of change or derivative of 
Y in the direction of X. However, this derivative requires a vector field X, 
not just a vector X „at a single point as does Vy, Y. Thus the two concepts of 
differentiation are essentially different. Property (3) of Theorem 2.11 gives 
their precise relationship. 


Exercises 


1. Prove Lemma 2.1. [Hint: The ideas in the next exercise may help.] 

2. Let U, ọ be a coordinate neighborhood on M c R" and let Fy,..., Em 
be the coordinate frames on U—as in (2.6). If pe U, then show that we 
may complete these frames to C® frames Fy,..., Fmn+1s--- Fn of R” on 
some neighborhood V c U of p. (This means that the components of 
these vectors relative to the frames of R” are C* functions of the local 
coordinates of M.) Using the Gram-Schmidt orthogonalization process, 
show that there is a C* orthonormal frame field F},..., Fp on this 
neighborhood V such that for each k = 1,..., n, the vectors F},..., Fi, 
and F,,..., F, span the same subspace. Use the new frames to give 
expressions for 2’, n” on V. 

3. Prove that if X, Y e ¥(M) and fe C*(M), then V;y Y = f Vy Y. Find the 
correct formula for Vy(fY). 
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4. Let M be a hypersurface of R” (submanifold with dim M = n — 1). 
Show that if p(t) is a curve on M c R" and Z(t) = Z „œ is a vector field 
along p(t) such that ||Z(t)|| = 1 and Z(t) is always normal to M [that is, 
orthogonal to T,)(M)], then DZ/dt is tangent to M. 

5. Let M be the right circular cylinder x? + y? = a? in R®. Find all curves 
p(t) = (x(t), y(t), z(t)) on R? which are geodesics, that is, for which 
(D/dt)(dp/dt) = 0. 

6. Let y=f(x), a < x < b, be a curve in the xy plane and let M c R? be 
the surface obtained by revolving the curve around the x-axis. Assume 
f(x) is C? at least and that f(x) > 0 on the interval. Show that the curve 
y = f(x), z = 0, and each curve into which it rotates is a geodesic. Deter- 
mine conditions for the intersection of M and a plane x = constant to be 
a geodesic. 


3 Differentiation on Riemannian Manifolds 


We now pass to consideration of abstract Riemannian manifolds— 
manifolds which are not submanifolds of Euclidean space. Our purpose is to 
develop a satisfactory theory of differentiation on such manifolds, having 
properties like those discussed above but intrinsically defined, that is, with- 
out imbedding M in Euclidean space. We will reverse the order of ideas in 
the last section and begin by an attempt to define for M a derivative Vy, Y of 
a vector field Y in the direction of a tangent vector X, to M at p. Of course, 
we use the properties discovered in the previous section as our model. In all 
that follows we shall suppose all vector fields and functions on M to be C”. 


(3.1) Definition A C® connection V on a manifold M is a mapping 
V: ¥(M) x ¥(M) > ¥(M) denoted by V: (X, Y) > Vy Y which has the lin- 
earity properties: For all f, ge C°(M) and X, X’, Y, Y' e ¥(M), we have 


(1) Vyx+ex Y =f (Vx Y) + g(Vx Y), 
(2) Vx(f¥ + gY’) =fVx Y¥ + 9Vy ¥' + (Xf)Y + (Xg)Y'. 


One should not attempt to read anything into the word “connection ”— 
it is just an operator like the directional derivative. Note the asymmetry in 
the roles of the first and second vector fields X and Y; V is C®(M) linear in 
X but not in Y. However, if fis a constant function, then Xf = 0; thus V is 
R-linear in both variables. Of course, we do not know that connections of 
this type exist, although by Theorem 2.11 they do for M imbedded in 
Euclidean space. In addition, according to that theorem we have in this 
special case two further properties: 


(3) [X, Y] = Vy Y — Vy X (symmetry), and 
(4) X(Y, Y’) = (Vx Y, Y’) + (Y, Vx Y’). 
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(3.2) Definition A C® connection which also has properties (3) and (4) is 
called a Riemannian connection. 


Note that in these definitions it is only property (4) that involves the 
Riemannian metric; thus on arbitrary differentiable manifolds one may 
study C® connections [properties (1) and (2)] or symmetric C” connections 
[properties (1)-(3)]. However, only the Riemannian case will be of interest to 
us. 


(3.3) Theorem (Fundamental Theorem of Riemannian Geometry) Let 
M be a Riemannian manifold. Then there exists a uniquely determined Rieman- 
nian connection on M. 


We prove this theorem in several steps in a manner somewhat similar to 
that of the existence proof for the operator d on /\(M). Before doing so we 
deduce a consequence of the definition of connection which will resolve a 
minor discrepancy with the last section. In the discussion of differentiation 
on manifolds imbedded in R” we defined the map X,—>Vy,Y from 
T,(M) > T,(M) using the vector field Y but without any assumption that X, 
was the value at p of a vector field X. However, given vector fields X and Y. 
a vector field Vy Y was then defined by (Vx Y), = Vx, Y for pe M, thus 
obtaining a map V of pairs (X, Y) of vector fields to a vector field Vy Y, as in 
our present definition. We have now taken this map on pairs of vector fields 
as the primary notion, and we wish to see that conversely, Y defines a linear 
map of T,(M) > T,(M) for each pe M, that is, to see that (Vy Y), depends 
not on the vector field X but only on its value X, at p. The same is not true of 
the dependence on Y as will appear in the corollary below. 


(3.4) Lemma Let X, Ye ¥(M) and suppose that either X = Qor Y = 0 on 
an open set U c M. If V is a connection [satisfying properties (1) and (2) of 
Definition 3.1}, then the vector field Vy Y = 0 on U. 


Proof Suppose that Y = 0 on U and q € U. There is a C” function f on 
M with supp(f) c U and f =1 on a neighborhood V of q (cf. proof of 
Theorem V.4.4). Note that V c supp(f) c U. Then since Y =0 on U, 
fY = 0on M. However, property (2) implies that Vy takes the 0-vector field 
to 0; therefore V,(f Y) = 0 on M. But then, using property (2) again we have 


0 = (Val f¥))q = (Xa f%e + SGV Vg = Vx Y) 


Since q is an arbitrary point of U, this completes the proof when Y = 0 on 
U. A parallel proof using property (1) applies when X = 0 on U. | 


In fact, since Vy Y is C?(M)-linear in X, essentially the same arguments 
imply a stronger property. 
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(3.5) Corollary Let p be any point of M. If X,X'eX(M) such that 
X, = Xp, then for every vector field Y, (Vx Y), = (Vx Y),. Denote this 
uniquely determined vector by Vx, Y. Then the mapping from T,(M) > T,(M) 
defined by X, > Vx, Y is linear. 


Proof Let U,@ be a coordinate neighborhood of the point p. As in the 
proof of the lemma, there is a C” function on M with supp(f) = U and f = 1 
on a neighborhood V of p (so V c U). If X e ¥(M), then on U we have 


X = Y aE; 
i=1 
with a;e C*(U) and E,,..., E, the vectors of the coordinate frames. We 
define X, E,,...,£,¢%(M) and d,,...,4,,¢C*(M) by X¥ = f?X, E, = fE, 
and a; = fa;, i = 1,...,n, on U, and all to be zero (vectors and functions 


respectively) on the open set M — supp( f). Then we have 
Xx = ă È, ape 8 oe a,E, 


on all of M; but on V this reduces to the equation above since X = X, 
E, = E; and @, = a, on this set. Applying Lemma 3.4 and property (1) of V 
gives 


Hence 
(Vx Y), = > a(p)(Ve, Y), = y a;(p)(Ve, Y), > 


where the right-hand side depends only on the value X, of the vector field X 
at p. This proves the first statement and the formula itself shows that the 
mapping X, > Vx ,Y = (Vx Y), is a linear mapping of T,(M) into itself. For 
its value depends linearly on the components a,(p), ..., a,(p) of X, relative 
to the basis E,,,..., Enp of T,(M). | 


An important consequence of Lemma 3.4 is that it allows us to define 
(unambiguously) the restriction VY of a connection V defined on M to any 
open subset U c M. This is done as follows. Let X, Y be C’-vector fields on 
U and let peU. As in the proof of the corollary we define X, Ÿ to be 
C®-vector fields on M such that £ = X and Y = Y ona neighborhood V of 
p. Then define (VX Y), =(VzY),, qe V. A different choice of ¥, Ÿ gives a 
different V, but by the lemma the value of VzY will be the same on the 
intersection of the two V’s. Since p is arbitrary, this defines a unique C® 
vector field Vy Y on U. It is easily verified as an exercise that V} is a 
connection and is Riemannian if V is—using the induced Riemannian metric 
on U. The importance of this follows from the next lemma. 
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(3.6) Lemma Suppose that a Riemannian connection V exists for every 
Riemannian manifold. If it is unique for manifolds covered by a single coordin- 
ate neighborhood U, then it is unique for all manifolds. Conversely, if there 
exists a uniquely determined (Riemannian) connection VY for every Rieman- 
nian manifold covered by a single coordinate neighborhood U, then there exists 
a uniquely determined Riemannian connection V on every Riemannian 
manifold. 


Proof We suppose that V is a Riemannian connection on M. By 
hypothesis there is a uniquely determined Riemannian connection Ve on 
each coordinate neighborhood U, œ of M (with the induced Riemannian 
metric). Let X, Y be vector fields on M and denote by Xy, Yy their restric- 
tions to U. It is an easy consequence of the definition of VY, the restriction of 
V to U, that V¥,, Yy = (Vx Y)y. Then on each coordinate neighborhood we 
have (Vx Y)y = V¥, Yy for, by the uniqueness assumption, Vv! = VY, Since 
M is covered by coordinate neighborhoods, this proves the first statement. 

Now suppose that V” is uniquely determined on every coordinate neigh- 
borhood U, ¢ of M. If there is defined on M a V with properties (1)-(4) it 
must be unique by the above. We shall define V on M as follows: Let 
X, Y e ¥(M) and let pe M. Choose a coordinate neighborhood U, ¢ con- 
taining p and define (Vy Y)y = V¥, Yy . This defines Vy Y not only at p but 
on the neighborhood U. It is easy to verify properties (1)-(4) since they hold 
for V'. Suppose V, W is a coordinate neighborhood overlapping U; let 
W = UAV. Then W is a coordinate neighborhood using either coordinate 
map ¢ or y and, V” being thus uniquely defined, we have at every point q of 


Ww 
(V, Yu)a = (VÝ, Yw)q = (Vi, Ya . 
This completes the proof of the lemma. | 


Proof of Theorem 3.3 The proof of the existence and uniqueness of a 
Riemannian symmetric connection, Theorem 3.3, is now reduced to the case 
of a manifold covered by a single coordinate neighborhood. Let U, p cover 
the manifold M and let x!,..., x" denote the local coordinates and 
E,,..., E, the coordinate frames. Denoting the inner product by (X, Y) we 
have as components of the metric tensor the C® functions g,(q) = (Eiq, Eja) 
on U = M. The matrix (g;,(q)) is symmetric, positive definite, hence it has a 
uniquely determined inverse (g'/(q)) whose entries are C” functions on U 
also. We shall show that there exists a unique Riemannian connection V on 
M. First we note that if V can be defined at all, then by properties (1) and (2) 
it is determined by the C” functions ri, on U,1 < i,j,k < n, defined by 
Ve E; = X2, I$ Ep. In fact, if X = È b'(x)E; and Y = } a'(x)E; on U, 
then from (1) and (2) and our definition of T$, 


ver => (xa +¥ Pyat) Be 
k id 
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Conversely, given functions Ti, on U, this formula defines a C* connection 
satisfying (1) and (2). 

However, the If, are not arbitrary C% functions since a Riemannian 
connection satisfies the further properties (3) and (4). Because [E;, E,] = 0 
for the coordinate frames, property (3) is equivalent to 


0 = [E; , E] = Vg E; Vg Ei = X (rk = T}; )Ex 


k 


or, in fact, to the symmetry of T$, in the lower indices: 
(3) T= Th 
Finally, property (4) is equivalent to 
Egi; = EAE; , Ej) = (Vz, Ei, E;) + (E; , Ve, E;) 
or to 
(4) Egy =}, Viigo + Vijgs)  1<ijk<n. 


Finally, using the matrix (g?) inverse to (g;;), we define Ti, = $, T} gs 
which implies that If; = } T;;g*. Thus the n? C® functions Ik, determine 
the n? C” functions T; and conversely. Properties (3’) and (4') become 


(3”) Vin = Vix 
and 
(4”) 6g;;/Ox* = [ky + Pays 


respectively, if we write E,g,; = 0g;;/0x*, that is, if we consider g;j as func- 
tions of the local coordinates. 

In summary, given a Riemannian connection on M, covered by a single 
coordinate neighborhood, then if a Riemannian connection V exists, it deter- 
mines n? functions F; of class C® which have the two properties just 
mentioned. Conversely, it is easy to check by reversing these steps that any 
such functions determine a C® Riemannian connection on M. Thus the 
theorem is completely established by the following lemma. 


(3.7) Lemma Let W be an open subset of R" and let (g;;) be a symmetric, 
positive definite matrix whose entries are C” functions on W. Then there exists 
a unique family of C” functions T(x), 1 < i,j,k < non W satisfying the two 
sets of equations (3") and (4"). 
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Proof Write Eq. (4”) twice more, each time permuting i, j, k cyclically. 
Then subtract the second of these equations from the sum of the first and 


third. Using (3”), Tix = Uj, gives the unique solutions 


1/69; Jk Ogri 
Di = Al ee +), 


Oxk ôx! ~— Gx! 


This completes the last step in the proof of the fundamental theorem 3.3. 


Suppose that U, is a local coordinate system with coordinates 
x!,..., x". Let E,,..., E, be the coordinate frames and Y = X aE, be the 
expression on U of the vector field Y. If pe U and X, = Y bE, then we 
have the following formula for Vy Y on U. 


(3.8) Corollary For each pe U, using the above notation, we have 


k 
(Vx), A =E (x Std rjav) E 
RANG 2 ij 


with 


1 095 «O89: ôg; 
re =- ksj Zsi _ Jij js | 
z. 2 » 4 te ox® + a) 


Proof As we have seen in the proof (V x Y)y is the same as Vi, Yy, that 
is, VY on X, Y restricted to U. For this reason we use the same symbol V for 
all cases. The formula of the corollary follows at once from applying proper- 
ties (1) and (2) defining a connection to Vs we,(), a‘ E,). 


Of course, this is the same formula we obtained earlier in the proof of 
Theorem 2.11 for a manifold M in Euclidean space. In fact we have an 
obvious corollary of the uniqueness of V: 


(3.9) Corollary In the case of an imbedded (or immersed) manifold in 
Euclidean space, the differentiation defined in Theorem 2.11 depends only on 
the Riemannian metric induced by the imbedding (but is otherwise independent 
of the imbedding). 


(3.10) Remark In Sections 1 and 2 we used the concept of differentiation 
of vector fields along curves dY/dt to define DY /dt and then Vx Y on sub- 
manifolds of R”. In this section we showed quite independently of the earlier 
discussion that there is a uniquely determined Riemannian connection V on 
every Riemannian manifold M. Using this result we come full circle and 
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define—for a vector field Y and curve p(t) on M—the covariant derivative 
DY/dt of Y(t) = Ypy by 


DY 
dt T Vapiat Y 
Let Y be given locally by Y = Y b*(x)E, and p(t) by x(t) = = (xi(t x”(t)). 


Then from Corollary 3.8, with X pn = $ (t )E; = Fay it is ie 10 rede- 
rive formula ie (renumbered here) 


611) E COPED) lk. 


Since ro depend only on the values of b!, ..., b”, components of Y 
along the curve, the formula is valid when Y is defined only at points of the 
curve. Of course on any interval of the curve Y may be extended to a vector 
field on M, but DY/dt is independent of the extension by (3.11). 


Constant Vector Fields and Parallel Displacement 


A vector field Y on M is said to be constant if Vy, Y = 0 for all pe M and 
X E T,(M). In general there do not exist such vector fields, even on small 
open subsets of M. However, given a differentiable curve p(t), 0 < t < T, 
there will be a vector field X(t) = X pẹ constant or parallel along p(t) (by 
which we mean DX/dt = 0). 


(3.12) Theorem Let p = p(0), the initial point of the curve p(t),0 < t < T, 
and let X , € T,o)(M) be given arbitrarily. Then there exists a unique constant 
vector field X yt) along p(t) such that X „o; has the given value. If F,,,..., Fup 
is an orthonormal frame at p(0), then there is a unique, parallel field of ortho- 
normal frames on p(t) which coincide with the given one at p = p(0). 


Proof The proof depends on the existence theorem (Theorem IV.4.1). 
We shall first demonstrate that if tọ is any point of the interval [0, T], and 
X pao) iS an arbitrary tangent vector to M at p(t»), then there exists an 
é-interval around to such that there is a unique constant vector field X pin 
taking the value X pro), at p(to) and defined at each p(t) on the intersection of 
(to — £, to + £) and [0, T]. To see this, we first choose 6 >0 such that for 
to— <t < tọ + 0, p(t) lies in a single coordinate neighborhood V, 9. By 
(3.11), Xan = ¥ SOE, pt) Will have the desired properties on some interval 
(to — & to + £) if and a if a(t) T 


Tha), k=1,... 
-È do Z, sexda 


and the initial conditions 3 a*(to)Erpao) = X pto» AS given. In this system of 
first order equations, the If, depend on t through p(t), and x‘(t) = (p(t) are 
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the local equations of the curve p(t). By Theorem IV.4.1, functions a*(t) with 
these properties exist and are unique on some interval around tọ. Moreover, 
their differentiability class is at worst one less than that of p(t) (since this is 
true of the derivatives dx'/dt). 

We further note that if an orthonormal frame F; pto)» + - -> F apts) iS given at 
p(to), it can be extended on p(t) as n individual constant vector fields over 
some ¢é-interval around tọ. But then for each i, j 


D DF, DF) _ 
nfo d= a i) + (Fo i) =o 


Hence (Fipo, Fip) is constant along p(t) on tp — €< t < to + € But since 
(F ipto Fipa) = Sij this holds on the interval, i.e., F1 pa)» - -> Fapa ÍS a constant 
orthonormal frame field on (tọ — €, to + £). 

It follows that beginning at tọ = 0, the conclusions of the theorem follow 
for all t < t4}, for some t, < T. If t, < T, this leads to a contradiction: We 
have, for example, X „n, a constant vector field defined for all 0 < t < t,. Let 
Fipo- -» Fnp be a constant orthonormal frame field defined on t, — < 
t < t, + £ for some € > 0 and let X py = >, b* (t)F xp) for the overlapping set 
(t; — e, t,). Since X pn and the frame field are both constant, the components 
b*(t) are constant; hence X pt) May be extended to the entire interval (t; — €, 
tı + £) by simply letting its components relative to the orthonormal frame 
field be these same constant values. Thus t, = T and the result is proved. f 


(3.13) Remark We remark that it is sufficient for the curve to be piece- 
wise differentiable, for then we can move X, along each piece separately. 
Therefore it follows from this theorem that given a piecewise differentiable 
curve p(t), there exists an isomorphism, in fact isometry, 1,: T,«o)(M) > 
T (M) determined by the condition that t,(X po) be a parallel (constant) 
vector field along p(t). It is clear from our initial discussion of dX /dt along a 
curve p(t) in Euclidean space that this would enable us to define the deriva- 
tive of vector fields along curves on a Riemannian manifold M by comparing 
vectors at different points of the curve. The notion of parallel displacement 
along curves is sometimes taken as the starting point in studying differentia- 


tion on manifolds. 
Exercises 


1. Show that if E,,..., E, is a parallel frame field along a differentiable 
curve p(t) in M and X(t) = X , is a vector field along the curve defined 


by X(t) = an a'(t)E ipa» then 


n da 
= 2, dt Eip - 


2. Using spherical coordinates we may cover the 2-sphere S of fixed radius 
a minus a single meridian from north to south pole by a single coordin- 
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ate system U, go. The parameter mapping g' takes W = {(u’, u°) | 
0 <u! < 22,0 < u? < n} onto the sphere as imbedded in R? by ø: 


ge} (u!, u?) = (a cos u' sin u?, a sin u’ sin u?, a cos u’). 


Find g;,(u', u°) and compute T'j,(u’, u’). 

3. Let the upper half plane be considered as a manifold M covered by a 
single coordinate system M = {(x!, x?) | x? > 0} with U = M and coor- 
dinates (xt, x). If g(x) = (x°)? 6,,, find T}. 

4. Show that isometries of Riemannian manifolds preserve the Riemannian 
connection, that is, if F: M, > M, is a diffeomorphism preserving the 
Riemannian metric, then F (VY? Y) = VP uy F(Y). 

5. Let M bea Riemannian manifold, W an open neighborhood of (uo, vo) 
on the uv plane, and F: W + M a C” mapping. Let @F/éu and 6F/év 
denote the vectors tangent to the curves v = constant and u = constant, 
respectively, and let D/du, D/ðv denote the covariant derivatives of any 
vector field along these respective curves. Using (3.11), show by direct 
computation that 

DCF _ DCF 
vôu ĉu dv 


To which property of the Riemannian connection does this correspond? 


4 Addenda to the Theory of Differentiation on a Manifold 


In this section we insert a brief treatment of two topics which are closely 
related to the previous section, but which we do not need or use until the 
next chapter. First, we introduce the Riemann curvature tensor, and second, 
we briefly treat connections from the point of view of exterior differential 
forms. 


The Curvature Tensor 


It is a standard theorem of advanced calculus that second-order partial 
derivatives are independent of the order of differentiation: 


ô ( ôf ) _ 6 (2 

axi loxi] ax exif’ 
For functions on manifolds the analogous property X(Yf) = Y(Xf) does 
not hold in general. Indeed [X, Y] measures the extent by which it fails: 


X(Yf) — Y(Xf) = [X, YF 


[It still holds if X = E; and Y = E,, since E, f may be identified with Of/6x*, 
k = 1,...,n, if we allow f to denote the expression for the function on M in 
local coordinates x', ..., x”.] 
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Since interchangeability of order of differentiation is measured by an 
interesting object [X, Y] in the case of functions, it is natural to study the 
same question for Vy and Vy derivatives of a vector field Z on M with 
respect to vector fields X, Y. A relatively simple example (Exercise 1) shows 
that in general Vy(Vy Z) — Vy(Vx Z) # 0; hence it determines a vector field 
on M which may be thought of as analogous to LX, Y]. In fact, however, a 
more important expression, which involves also the measure of noninter- 
changeability of derivatives of functions [X, Y], is the following related 
vector field, denoted by R(X, Y)Z or R(X, Y): Z: 


(4.1) R(X, Y): Z = Vy(Vy Z) — Vy(Vx Z) — Vix, Z- 


It is readily verified that this formula defines a multilinear mapping of 
X(M) x ¥(M) x ¥(M) > X(M), that is, R(X, Y): Z is R-linear in each 
variable. However, from another point of view, in this expression R(X, Y) is 
an operator, determined by the vector fields X and Y, and assigning to each 
vector field Z a new C®-vector field R(X, Y): Z. Note that if [X, Y} = 0, as 
is the case when X = E;, Y = E; are vectors of a coordinate frame, then 


R(X, Y): Z = Vx(VyZ) — Vy(Vx Z), 


so that if R(X, Y) = 0 on M, then Vz, and Vz, are interchangeable for all Z. 
A purely formal reason for the added term —V,,, n Z in the definition is so 
that the following important theorem holds. 


(4.2) Theorem At any point p, the vector (R(X, Y)- Z), depends only on 
X ne YpsZps the values of the three vector fields at p, and not their values in a 
neighborhood or on M; thus formula (4.1) assigns to each pair of vectors 
X,, Yp€ T,(M) a linear transformation R(X „, Y,): T,(M) > T,(M). In fact, 
(X,, Yp) > R(X,, Yp) is a linear mapping of T, (M) x T (M) into the space of 
operators on T,(M). 

Proof From the definition of R(X, Y): Z we see that it depends R- 


linearly on each of the three arguments X, Y, Z. Moreover if f is a C* 
function on M (not necessarily constant), we have 


R(fX, Y): Z = R(X, fY): Z = R(X, Y): fZ = R(X, Y): Z 


as we may easily check by direct computation. 

Now suppose that U, ọ is a coordinate neighborhood. Let (x’,..., x") 
denote the local coordinates and E,,..., E,, the coordinate frames. We sup- 
pose that X = X «'E;,, Y =) P/E,;, Z = } yE. Then by the remarks 
above, 

R(X, Y): Z = } a'Bly"R(E;, Ej) ° Ex 


i jek 
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and we see that at a given point p of U the right-hand side involves first 
R(E;, E;); Ep, which is independent of the vector fields, and second the 
values of the functions «', p, y' only at the point p itself, not at nearby points. 
This proves the theorem. We used only properties (1) and (2) of the connec- 
tion V, but the next fact uses the Riemannian metric. i 


(4.3) Corollary The formula R(X, Y, Z, W) = (R(X, Y): Z, W) defines a 
C®-covariant tensor of order 4. This tensor depends only on the Riemannian 
metric on M: If M,, M, are Riemannian manifolds and F: M, > M, is an 
isometry, then F*R, = R,. 


Proof Since R(X,, Y,)* Z, is defined as an element of T,(M) for any 
pe M, its inner product (R(X,, Y,): Zp, W,) with any W,e T,(M) is a well- 
defined real number. Thus for each p, R,(X,, Yp Zp, W) = 
(R(X,, Yp): Zp, Wp) defines a multilinear function of four variables on 
T,(M j, ‘that is, an element of 7 *(T,(M)). This clearly defines a C*-tensor 
field since both inner product and R(X, Y)-Z are C” for 
X, Y, Z, We ¥(M). 

We have defined an isometry of Riemannian manifolds to be a 
diffeomorphism which preserves the Riemannian metric, that is, 
F,: T,(M1) > Trl M2) preserves inner products (and is an isomorphism 
onto). [If we do not suppose that the C® mapping F is one-to-one onto, but 
only that F, is onto and preserves inner products, then it is called a local 
isometry. It is an isometry on some neighborhood of each point (for exam- 
ple, covering spaces). The last statement of Corollary 4.3 is valid for local 
isometries also.] Now since V is uniquely determined by the Riemannian 
metric, F, preserves the connection, more precisely F (Vx Y) = 
Vi,a)F,(Y). From this we deduce that R,(F,X,F,Y) FZ = 
F (R(X, Y)- Z). Since inner products are preserved, this implies F*R, = R,. 

| 


(4.4) Definition The operator R(X, Y) is called the curvature operator and 
the tensor R(X, Y, Z, W) is called the Riemann curvature tensor. [It is not 
difficult to see that each one determines the other (Exercise 2).] 


(4.5) Remark Let £,,..., E, be a field of frames on U, an open set of M. 
Then the Riemann curvature tensor is uniquely determined on U by either of 
the nî sets of functions R4; or Rj; defined by the equations 


R(E; , E)E; = 2 RE; 


and 


R(E; , E, , E; , E;) = Rijs = È Ijs Ria, Gis = (Ej, Es). 
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The Riemannian Connection and Exterior Differential Forms 


There is another way of formulating the properties of covariant deriva- 
tives, connections, curvature tensor, and so on, which we shall now touch 
upon—it will be more fully treated later. We suppose that U is an open 
subset of a manifold M which has defined over it a field of C* frames 
E,,..., E,. The most usual case is when these are the coordinate frames of a 
coordinate neighborhood U, @. However, in the case of a Riemannian mani- 
fold, which is our present interest, we might find it convenient to consider a 
neighborhood with orthonormal frames (Exercise 4). Corresponding to 
E,,..., E,, we have at each pe U the dual basis 8+, ..., 0” of T*(M), charac- 
terized by 6‘(E;) = ô}. It is a field of dual coframes on U and is clearly C*. If 
6’, ..., 0” are given, then conversely E}, ..., E, are determined (Section V.1). 

Now in defining Vy Y on a manifold so as to satisfy properties (1) and 
(2), we saw that it is enough to know Vg, E;; for Vy Y may then be computed. 
In fact, Vz, E; = >), Vk, Ep, and we determined the If, above. If a connection 
is given so that Ij; are known on U, then we may define n? one-forms 6} by 
o = >, F0. Conversely, given these one-forms, then If, = 6¥(E;), and 
hence Vz, Ej, and the connection is determined. Indeed one checks at once 
that Vy E; = $ 0)(X)E,, that is, the values of the forms 0},..., 07 on X are 
the components of Vy E; relative to the given frames. Therefore, given U and 
0t, ..., 0" a field of coframes on U, then the connection is determined on U 
by the n? forms 64. They are called the connection forms. 

Of course, the n? connection forms 6% are not arbitrary, they must satisfy 
certain conditions corresponding to properties (1)-(4) of Definition 3.1 if 
they are to determine a connection on U, especially in the case of a Rieman- 
nian connection—the one we are interested in. We have the following 
restatement of the fundamental theorem of Riemannian geometry in terms 
of forms—-although we restrict ourselves only to the case in which the mani- 
fold is covered by a single coordinate neighborhood or more precisely a 
neighborhood on which is defined a frame field. 


(4.6) Theorem Let M be a Riemannian manifold such that it has a covering 
by a C” field of coframes 0', ..., 0". Then there exists a uniquely determined 
set of n? C® one-forms ©, 1 < j, k < n, on M satisfying the two equations 


(i) da — >, 0/6; =0, 

(ii) gi; = Dix (Oig + Ogu) 
where gi; = (E;, E;), with E,,..., E, the uniquely determined field of frames 
dual to 0,,...,9,. The forms O} so determined define the Riemannian connec- 
tion satisfying properties (1)-(4) of the fundamental theorem by the formulas: 


(iii) VyE, = Ð OM(X)E,, and 
(iv) Vx(fY) = (XP)Y + fVx Y, fe C*(M). 
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Conversely, the Riemannian connection determines 6 as explained above 
and these O; satisfy (i) and (ii). 


The proof is basically a duplication of that of the fundamental theorem 
[Theorem 3.3] in the case of manifolds covered by a single coordinate neigh- 
borhood (see Exercises 6-8). It can then be extended to general M in similar 
fashion. 

If we recall that a Riemannian manifold M of the type described may be 
covered by an orthonormal frame field E,,..., E, with g; = (E;, E;) = 6,, 
then we have a nicer version of the above. In this case we denote 6' by wi and 
6; by œ$. Using the fact that g;; = ô; (and hence dg,; = 0), we obtain the 
following corollary: 


(4.7) Corollary Let M be a Riemannian manifold which has a covering by a 
field w',...,@" of coframes whose dual frames E,,..., E, are orthonormal. 
Then there exists a unique set ofn? one-forms wk, 1 < j,k < non M satisfying 


(i) dœ — Y}; ara = 0, 
(ii) wf + œj =0. 


These w} determine the Riemannian connection (as above) and conversely. 


Finally, we note that since 0% are uniquely determined by 6',..., 6" and 
the Riemannian metric—the coframe field and the metric—then the 
exterior derivatives d0% are also uniquely determined, as are their expres- 
sions as linear combinations of the basis 6’, @/, 1 < i < j < n, of two-forms 
on the domain U of 6',..., 6". As we shall see in the next chapter, the 
coefficients in these linear combinations determine the components of 
the curvature tensor. 


Exercises 


1. Using Vg, E; = }, I% E, and Exercise 3.3, show that for the metric 
gy = (x7)? ô; on {(x*, x?)]x? > 0}, 


Ve (Ve; Ej) - Vz; (Ve, E;) + 0. 


2. Show that R(X, Y, Z, W) determines R(X, Y) - Z, that is, if the values of 
the former on all vector fields are known, the same holds for the latter. 

3. For a local coordinate system, compute Rj, and R;jų in terms of T% 
and gij. 

4. Show that w’,...,w"e /\'(M) satisfy the hypotheses of Corollary 4.7 if 
and only if the Riemannian metric is defined by the tensor Xi w R wi. 

5. Suppose that F: ¥(M) x © x ¥(M) > ¥(M) is a C”(M)-multilinear 
mapping of k vector fields (X,,...,X,) on M to a vector field 
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F(X., ---, X;) [C”(M) linear in each vector field separately], M a 
Riemannian manifold. Show that 


D(X i,- Xe, X1) = (F(X r -3 Xe) Xi41) 


defines a covariant (k + 1) tensor. 

6. Use Lemma V.8.4 to show that [X, Y] = Vy Y — Vy X is equivalent to 
property (i) of Theorem 4.6. Prove that property (ii) is equivalent to 
X(Y, Z) = (Vx Y, Z) + (Y, Vx Z) if we use Xf = df (X). 

7. Prove directly, using differential forms, that there exists one and only 
one set of forms 6% satisfying (i) and (ii) as asserted in Theorem 4.6. 

8. Complete the proof of Theorem 4.6 using the results of Exercise 7 by 
showing that (iii) and (iv) define a Riemannian connection on M as 
claimed. 


5 Geodesic Curves on Riemannian Manifolds 


As a first example of the use of covariant differentiation on a Riemannian 
manifold—we shall define and study the class of curves called geodesics. Let 
p(t) be a curve on M and dp/dt its velocity vector, defined for some open 
interval a < t < b of R; we suppose it to be of class C? at least. 


(5.1) Definition The (parametrized) curve p(t) is said to be a geodesic if its 
velocity vector is constant (parallel), that is, if it satisfies the condition 
(D/dt)(dp/dt) = 0, the equation of a geodesic, for a < t < b. 


As we saw previously, when M = R" with its usual metric this implies 
that the curve is a straight line. But in Section 2 it was seen that for a 
submanifold of R” this can mean something quite different, an example being 
the great circles on S"7! < R”. 

The parameter on a geodesic is not arbitrary; the fact that a curve isa 
geodesic depends both on its shape and its parametrization as we may see 
from the example of a (geometric) straight line in R* given parametrically by 
x! = 3, x? = t’. We write p(t) = (t°, t°); then 


dp z Ê i 
AAE 3 L. 
a oe ee 


Since D/dt = d/dt in R’, we have 
Did D(,D 2 ĉ ĉ ô 
2 (4?) = (322. cn fer peer eae a) 
aa) AE ol ee) ae 


Therefore this curve is not a geodesic although the path traversed is the line 
x! = x?. If p(t), O < t < b, is a nontrivial geodesic (not a single point), then 
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the permissible parametrizations—those with respect to which it remains a 
geodesic—are given by the following lemma. 


(5.2) Lemma Let p(t), a < t < b, be a nontrivial geodesic and let t' be a 
new parameter. With respect to t' the curve will be a geodesic if and only if 
t = ct' + d, c + O and d constant. In particular, the arclength is always such a 
Parameter. 


Proof If we introduce a new parameter t by t = ct’ + d, c + 0, then 
dp/dt' = c dp/dt and (D/dt')(dp/dt') = c?(D/dt)(dp/dt) = 0; so the curve re- 
mains a geodesic relative to t’. Now let s be arclength measured from P(to), a 
point of the curve. Then ds/dt = ||dp/dt||. Since dp/dt is constant along the 
curve, by (2.3c) its length ||dp/dt|| is constant. Either ||dp/dt| is identically 
zero with p(t) a single point and s = 0 or else ds/dt = ||dp/dt|| = c, a nonzero 
constant, and s = ct + d. This means that the curve is a geodesic when 
parametrized by arclength. Since any other permissible parameter is related 
to arclength by a similar (linear) relation, any two parameters are linearly 
related. i 


In order to make general statements about geodesics on manifolds we 
shall need to study the defining equation in some detail using the existence 
theorem (IV.4.1). We can, however, give a few further examples by virtue of 
the following two observations. First, the equation of a geodesic imposes 
only a local condition on the curve. More precisely, if each point of a curve C 
has a neighborhood in which it may be written in the form p(t), a < t < b, 
with (D/dt)(dp/dt) = 0, then it is a geodesic; for then, using arclength from 
some fixed point as parameter on all of C, it must satisfy the equation 
(D/ds)(dp/ds) = 0 over its entire length. Second, the property of being a 
geodesic is preserved by isometries because covariant differentiation is 
preserved and therefore so is parallelism of a vector field (for example, dp/dt) 
along a curve. 

Now we let 2:R*-—T? be the standard covering discussed in 
Example IIJ.6.15 and in Section III.9. We take R? with its usual Riemannian 
metric. Since the covering transformations are translations, they are isome- 
tries of R?. It follows that we may define on T? a Riemannian metric which 
makes the projection z a local isometry, meaning that z, is an isometry of 
each tangent space T,(R*) onto T,,,,(T7). With this metric the geometry of 
T? is locally equivalent to that of Euclidean space. [This Riemannian metric 
should not be confused with the metric induced on a torus imbedded in R? 
by the standard Riemannian metric of R°.] Combining our two observa- 
tions, it follows that even a local isometry, as for example this map 7, carries 
geodesics onto geodesics. This means that the images of straight lines of R? 
on T? are geodesics of T? (Fig. VII.10). In particular, lines of rational slope 
map to closed geodesics on T?, lines of irrational slope do not—they are 
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y (A 
R? 


Figure VII.10 


dense on T?. Nothing like this occurs in R?, where geodesics can be neither 
closed curves nor dense. Thus “straight lines” even on spaces locally isome- 
tric to Euclidean space present some fascinating variations from what we 
might expect. 

A similar, even simpler example is a right circular cylinder 
M = {(x, y, z)¢ R? | x? + y? = 1}. The Riemannian metric induced by that 
of R? is the same as that given by the covering z: R? > M (of Exercise 1). In 
this case the covering map is given by rolling up the plane into an infinite 
cylinder each strip of width 2r covering M once. Details are given in the 
exercises. 

In order to study properties of geodesics on a Riemannian manifold M, 
we pass to local coordinates (x',..., x") on a connected coordinate neigh- 
borhood U, g. Then by (3.11) the equation of a geodesic (D/dt)(dp/dt) = 0 is 
equivalent to the system of second-order differential equations: 


d?’ x“ n dx' dx! 
5.3 — T(x) — — = c= 1,..., 7. 
(5.3) qe? D K(x) wa 0,  k=1,...,n 


A solution is a curve given in local coordinates by n functions 
(x! (t), ..., x"(t)) which satisfy (5.3). As usual let E,, ..., Z, denote the coor- 
dinate frames. We may apply our existence theorem IV.4.1 to prove the 
existence and uniqueness of a geodesic through each pe U with prescribed 
tangent direction at p and study its dependence on p and the tangent 
direction. 


(5.4) Lemma Given any q€ U, we can find a neighborhood V of q with 
V c U and positive numbers r, ô such that for each pe V and each tangent 
vector X, =} b'E; with ||X,| <r, there exists a unique solution 
(x! (t), ---, x”(t)) of (5.3), defined for —5 < t < 6, which satisfies x'(0) = x'(p) 
and x'(0) = bi, i = 1,..., n. Let p(t) = @ (x(t), ..., x"(t)) as just defined. 
Then p(t)e U for |t| < ô. 
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Proof We consider the following system of 2n first-order ordinary dif- 
ferential equations defined on the open subset W = g(U) x R" c R" x R” 


= R”: 
dx* 
aa ~ k=1,... 
( ) dt y5, d > n, 
dy" es 
= — PY TEx)’, k=1,...,n. 
dt i,j=1 
The right-hand sides are C® functions of (x, y) = (xt, ..., x"; yt, ..., y") on 


W. Therefore, according to the existence theorem for ordinary differential 
equations cited above, for each point in W there exists a ô > Oanda neigh- 
borhood V of the point with the property that given (a;b)= 
(a',...,a";b*,..., b")e Ñ, there are 2n unique functions x* = f*(t, a; b) and 
y* = g*(t,a; b), k =1,...,nand |t| < ô, satisfying the system of equations 
(5.5) and the initial conditions f*(0,a;b)=a* and g*(0,a;b) = b, 
k = 1,...,n. These functions are C® in all variables and have values in W. If 
peU, we consider the point (¢(p); 0) = (x'(p),..., x"(p);0,..., O)e W. 
Then there is a 6 > 0 and a neighborhood V of (o(p), 0) as described. This 
neighborhood may be chosen to be of the form ọ(V) x B%(0) for some V 
with V < U compact and r’ > 0. Since V is compact, we may find a number 
r>0 such that if (È gi(x)bjb;)'? = |X | <r and peV, then 
(È (b'?)'? < r. This follows from the inequalities used in the proof of 
Theorem V.3.1. We see at once from the special nature of system (5.5) that 
df*/dt = g* and hence 
ay _ . df' df 
d? ie a dt’ 

In other words x*(t) = f*(t, a; b) are solutions of the system of equations 
(5.3), and therefore the equations in local coordinates of geodesics satisfying 
x*(0) = a* and (dx*/dt),_9 = b, k = 1, ..., n. Finally, according to the exist- 
ence theorem cited, the image of J; x V under the map 


(t, a, b) > (f*(t, a; b),..., f”(t, a; b); g'(t, a; b), ..., g”(t, a; b)) 
is in W which proves that p(t) = p '( f(t, a; b))e U. I 
The lemma has the following corollary, which guarantees the existence 


of a unique open geodesic arc through any given point with prescribed 
direction. 


(5.6) Corollary If M is a Riemannian manifold, pe M, and Y, is a nonzero 
tangent vector at p, then there is a 4 >0 and a geodesic curve p(t) on M 
defined on some interval —5<t<6, 5>0, such that p(0) = p, 
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(dp/dt),-) = AY,. Any two geodesic curves satisfying these two initial condi- 
tions coincide in a neighborhood of p. 


To see that this is so we take a neighborhood U, of pand choose A > 0 
so that || ¥,|| < r as in Lemma 5.4; then we apply the lemma. 


(5.7) Remark Since “being a geodesic” is a purely local property of 
parametrized curves—meaning that they satisfy a differential equation—it is 
clear that if two geodesic curves C, and C, coincide (as sets) over some 
interval, then their union—suitably parametrized—is a geodesic. Further, we 
now see that if two geodesics have a single point in common and are tangent 
at that point, then their union is a geodesic. This implies that each geodesic is 
contained in a unique maximal geodesic. A maximal geodesic is one that is not 
a proper subset of any geodesic: If it is parametrized by a parameter t with 
a< t< b, then a and b (which can be — œ and/or +0) are determined by 
the curve and the choice of parameter. It is not possible to extend the 
definition of p(t) (with the given parameter) so as to include either of these 
values and so that it will still be a geodesic. We shall be interested in 
determining conditions on M which ensure that a = —œ and b= + for 
every geodesic, or that every geodesic can be extended indefinitely in either 
direction. By Lemma 5.2 this property would be independent of parameter. It 
is easy to see that this is not always possible: let M be R? with the origin 
removed. Then radial straight lines cannot be extended to the origin. 
However, given a geodesic through a point p, clearly we can always 
reparametrize it so that p = p(0) and p(t) is defined for |t| < 2, say. Making 
use of this fact, we modify Lemma 5.4 slightly to obtain our basic existence 
and uniqueness theorem for geodesics. 


(5.8) Theorem Let M be a Riemannian manifold and U, @ a coordinate 
neighborhood of M. If qe U, then there exists a neighborhood V of q and an 
e > 0 such that if pe V and X „€ T,(M) with |X || < £, then there is a unique 
geodesic p(t) = p(t, p, X,) defined for —2<t< +2 and with p(0) = p, 
(dp/dt),-9 = Xp. The mapping into M defined by (t, p, Xp) > plt, p, Xp) is 
C® on the open set |t| <2, pe V, ||X,|| < £ and has its values in U. 


Proof According to Lemma 5.4, we may find a neighborhood V of q 
and numbers r, ô > 0 such that given any pe V and vector X „e€ T,(M) with 
||X,|| < r, then there is a geodesic p(t) defined for |t| < ô and satisfying the 
initial conditions p(0) = p, (dp/dt)o = X,. We know that if we change toa 
parameter t = ct’, c # 0 a constant, then p(t’) = p(ct’) is again a geodesic 
with p(0) = p and dp/dt’ = (dp/dt)(dt/dt') = c dp/dt; thus (dp/dt')y = cX,- 
Now if ô > 2, we may use £ = r and we have no more to prove; but ifd < 2, 
we let e = dr/2. Then, if pe V and X, is a tangent vector at p with |X,| <6 
we know from the choice of e that ||2X,/6|| < r. Thus there is a geodesic p(t) 
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with p(0) = p and (dp/dt)) = 2X,,/6 defined for |t| < ô at least. The curve 
p(t') = p(dt'/2) is again a geodesic and satisfies p(0) = p, (dp/dt')y = (5/2) x 
(dp/dt)) = X,. Moreover it is defined for |6t'/2|<6, that is, for 
—2 < ť < +2. This completes the proof; the last statement is already con- 
tained in Lemma 5.4. i 


Exercises 


1. Show that the mapping z: (u, v) > (cos u, sin u, v) carrying R? onto the 
cylinder M = {(x, y, z)|x? + y? = 1} c R? is a covering and a local 
isometry onto M with the induced metric of R?, from R? with the usual 
metric. 

2. Use Exercise 1 to show that the geodesics on M are the helices, that is, 
curves which cut each generator at the same angle (or have a constant 
angle with the z-axis), the generators themselves, and the circles of inter- 
section with planes z = constant. Find how many geodesics connect two 
given points p, q. 

3. Show that two isometries F,, F}: M > M of a Riemannian manifold 
which agree on a point p and induce the same linear mapping on T,(M) 
agree on a neighborhood of p. Can you improve this statement? 

4. Using the notation and Riemannian metric of Exercise 3.3, show that if 
suitably parametrized, the curves x' = constant are geodesics on M. 


6 The Tangent Bundle and Exponential Mapping. 
Normal Coordinates 


Although it may not have been apparent, the process by which we passed 
from a second-order system of equations (5.3) to a first-order system (5.5) 
when we first studied geodesics was to introduce new variables which corre- 
sponded to the components of tangent vectors at points of a coordinate 
neighborhood U, œ. These vectors X,, pe U, are in one-to-one correspon- 
dence with points (x; y) of the open set W = (U) x R" c R" x R". The 
correspondence, which we denote by @, is given by (X,)= 
(olp); y’, ..., y"), where o(p) = (x',...,x") are the coordinates of p, 
X,=) Ej, and E,,...,E, are the coordinate frames. The differential 
equations of geodesics (5.3) were interpreted as a system of first-order differ- 
ential equations (5.5) on W. Like all such systems, they correspond to a 
vector field on W (which we discuss in Section 7). 

In order to free ourselves from working exclusively with local coordin- 
ates, it is natural to try to think of W being the image under @ of a coordin- 
ate neighborhood U, @ on a manifold. This is possible. It requires that we 
define a manifold structure on the set of all tangent vectors at all points of 
M, which we shall denote T(M) (compare Section IV.2). When this is done, 

T(M) = {X,€T,(M)|pe M} = U T,(M) 


peM 
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will become a space, in fact a C% manifold, whose points are tangent vectors 
to M (compare Section I.5). In view of the introductory remarks, it is clear 
that we shall want the subset U consisting of all X, such that pe U to bea 
coordinate neighborhood with @ as coordinate map and W as image, 
: U > W. This virtually dictates the choice of topology and differentiable 
structure. Let z: T(M) > M be the natural mapping taking each vector to 
its initial point x(X,) = p; then x” ‘(p) = T,(M). 


(6.1) Lemma Let M be a C”-manifold of dimension n. There is a unique 
topology on T(M) such that for each coordinate neighborhood U, 9 of M, the 
set U = n“! (U) is an open set of T(M) and @: U > @(U) x R", defined as 
above, is a homeomorphism. With this topology T(M) is a topological manifold 
of dimension 2n and the neighborhoods U, @ determine a C®”-structure relative 
to which n is an (open) C”-mapping of T(M) onto M. 


Proof Let U, ọ and U’, g’ be coordinate neighborhood on M such that 
U n U’ 4 Ø; then U n U’ + Ø. Comparing the coordinates of pe U ^ U' 
and the components of any X,¢7,(M) relative to the two coordinate 
systems, we obtain the formulas for change of coordinates in Un Us 


` seir P i i : n of" n ; n 
Q o D(x", ...,X ora (S (x), -f (x); UY Lpa si) 


i=1 


where x" = f'(x!,..., x"), i = 1, ..., n, are the formulas for change of coor- 
dinates g'o 7! on UCU’ and the change of components is as in 
Corollary IV.1.8. These are easily seen to be diffeomorphisms of (0 ^ U’) 
onto @’(U n^ Ü’). The remainder of the verification is left as an exercise. Note 
that in local coordinates z corresponds to projection of R" x R” onto its first 
factor. We should also note that locally, on the domain U of each coordinate 
neighborhood of the type above, T(M) is a product manifold, that is, as an 
open submanifold of T(M), U is diffeomorphic to g(U) x R”. In the case of 
Euclidean space, U, œ may be taken to be all of M = R” so that T(R") is 
diffeomorphic to R" x R". It is clear that for every manifold M, 
dim T(M) = 2 dim M. I 


(6.2) Definition T(M) with the topology and C® structure just defined is 
called the tangent bundle of M, x: T(M) —> M the natural projection. 


Using Theorem 5.8, we may define Exp, the exponential mapping; its 
domain 2 is some subset of T(M). It is a nontrivial matter to characterize 
exactly what this subset is. However, the range of Exp is M itself, thus 
Exp: 2 > M maps a vector X, to a point of M. The name derives from the 
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exponential mapping of matrices (Section IV.6) for reasons which will be 
discussed later. Now let U, be a coordinate neighborhood of M and 
suppose qe U. If a neighborhood V of q and an ¢ > Q are chosen as in 
Theorem 5.8, then for each X, with pe V and ||X,|| < £, or equivalently, in 
the open subset {X,|peV, |X,|| <£} of T(M), the geodesic p(t) with 
p(0) = p and (dp/dt)) = X, is defined for |t| < 2. On this open set of T(M) 
we define Exp as follows. 


(6.3) Definition Exp X, = p(1), that is, the image of X „ under the expon- 
ential mapping is defined to be that point on the unique geodesic determined 
by X, at which the parameter takes the value +1. 


Thus each qe M has a neighborhood V such that Exp is defined on the 
open subset {X,|peV, ||X,| <e} = 2 '(V). (Note that € depends on q 
and its neighborhood V.) This information on 2 may be restated as follows: 
Let Mo be the submanifold of T(M) consisting of all zero vectors 0,,peM. 
Then p—0, maps M onto Mo diffeomorphically and x: My > M is its 
inverse. The application of Theorem 5.8 then guarantees that the domain F 
of Exp contains an open neighborhood of Mo in T(M). 

We also note that since |dp/dt|| is constant along a geodesic p(t), its 
length L from p(0) to p(1) is 


ntl 


Thus Exp X, is the point on the unique geodesic p(t) determined by X p 
whose distance from p along the geodesic is the length of X p (see 
Fig. VII.11). 


dt = [UX de = 1X ph 


Figure VIL11 
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We shall use the following lemma to obtain identities (6.5): 


(6.4) Lemma Assume that qe M and that X,€ T,(M) for which Exp X, is 
defined. Then Exp tX, is defined at least for each t with O<t<1 and 
q(t) = Exp tX, is the geodesic through q at t = 0 with (dq/dt)y) = X4. 


Proof Let q(t) be the unique geodesic with q(0) = q and (dq/dt)) = X, 
so that Exp X, = q(J). Given c,0 <c <1, consider the geodesic q(t) = q(ct). 
We have (0) = q and (d@/dt),-» = cX, which means that Exp cX, = 4(1) = 
q(c). Replacing c by ¢ in this equality gives the statement above. | 


We now revert once more to local coordinates U, p around q€ M and let 
V c U ande > 0 be as in Theorem 5.8 again so that for pe V and |X,| < e, 
Exp X, is defined. As in the proof ot Lemma 5.4, the geodesic determined by 
p, X, is given in local coordinates by 


t > (f (t, a; b), -.-, f"(t, a; b)) 


with ọ(p) = a = (a!, ..., a") and X, = b'E,, ++: + b"E,,. This means 
that 


o(Exp X,) = (f*(1, a; b), ..., f"(1, a; b)) 
and further that for |r| < 1 
p(Exp tX,) = (J (1, a; tb), ..., fC, a; tb)). 


However, the Lemma 6.4 and the meaning of the functions fi(t, a; b) then 
give us the following identities valid for |t| < 1: 


(6.5) POG ..., a"; ag) = f(t, a, ..., a"; b, ..., b"). 


From these remarks we can draw some conclusions concerning the ex- 
ponential map. First note that the f i are C® on their domain, hence 
X, > Exp X, is C” on {X,| pe V, |X,]| < £}. Second, we may compute the 
Jacobian of Exp, at X,=0,, the O vector at q, qE V. We denote by 
Exp, the restriction of Exp to T,(M) 0 2. Now q is fixed, (a',..., a") are 
constants, and the Jacobian matrix at this point has as entries ôf iJĝbi eva- 
luated at (1, a', ..., a”, 0,..., 0): 


fi o lp l 
= lim —(fi(1, a;0,..., h, ...,0) — f'(1, a; 0,..., 
ap T Hien UL a5 0, -h ,0) — fi(L, a; 0)) 


Using the identities (6.5), with b/ = 1 and b* = 0 for k # j, first with t = h, 
then with t = 0, this becomes 


fi n lou , 
a, = ae $ K eee see — f' ; t. 1 e. 
abi eed (h, a; 0, > 1, , 0) f (0, a, 0, ; 13 ,0)) 


= fi(0, a, ..., a"; 0, ..., tons O). 
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Since x' = f'(t,a’,...,a";0,...,1,...,0), i= 1,..., n (with b/ = 1 and 
bk = 0 if k + j), considered as functions of t, are the equations of the 
geodesic through q with E, as initial vector, we see that the Jacobian matrix 
reduces to the identity at X, = 0,, that is; df‘/éb’ = ôi. It follows that for q 
fixed and for some ¢’ < e the mapping X, > Exp X, is a diffeomorphism of 
the open set Ñ = {X,| ||X,|| < £} of T,(M) onto an open set N containing 
q = Exp 0,. Retaining the notation Exp, for Exp restricted to that part of its 
domain in T;(M), we summarize these results as follows. 


(6.6) Normal Neighborhood Theorem Every point q of a Riemannian mani- 
fold M has a neighborhood N which is the diffeomorphic image under Exp, ofa 
star-shaped neighborhood N of the zero vector 0, of the vector space T,(M). 


We have defined N by || X,|| < e'. Since the norm in T,(M) is given by the 
Riemannian metric, we may choose an orthonormal basis F o F, of 
T (M), and then, writing X, = )'"_, y'F,, we have |X, = Y?_, (y')?. With 
these choices, the mapping 


y: Ex ÈA) m (yt, ..., y") 
i=1 
takes the open neighborhood N = Exp, (Ñ) of q diffeomorphically onto 
B; (0) = R". 


(6.7) Definition The coordinate neighborhood N, y of q defined in this 
way is called a normal coordinate neighborhood. 


(6.8) Remark Normal coordinates have special features that make them 
useful in the study of the geometry of the manifold. Of these the most 
important are the following: 
(i) gi;(0) = Oy. ; : 
(ii) The equations of the geodesics through q take the form y'= a't, 
i= 1,..., n, a constants. 
(iii) The coefficients of the connection vanish at q: 


Tr4(0) = 0, i j, k = | ee 2 


The first and second statements are immediate consequences of the 
definition and Lemma 6.4. The third follows from the second since for all 
a’, ..., a" close to zero, substitution of the solutions y? = a't in the equations 
of the geodesics yields 


LTO‘ =0, k=1,....0, 
ij 
which implies (iii). 
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In fact the same computations used in the proof of the existence of 
normal neighborhoods give us a stronger result which we shall also find 
useful. We again let U,@ be a coordinate neighborhood of qe M, let 
E,,..., E, denote the coordinate frames, X, = ) b'E,, the tangent vectors 
to pe U, and o(p) = (x', ..., x”) the local coordinates. We have shown that 
there exists a relatively compact neighborhood V of q, V c U,andane > 0 
such that Exp X, is defined and in U for each X, with pe V and with 
|X ,,|| < £. Then in local coordinates 


e(Exp X,) = (f7 (1, xl, a., X"; bt, ca b”), 00, f EX pee ey PY) 


with f‘(t, x, b) being C® in all variables. We held p fixed at q to study the 
map Exp, from T,(M) to M. Now, however, we consider the mapping F of 
the open set @({X lp € V, ||X,|| < £}) = R" x R” to 


o(U) x e(U) c R" x R" 
which is defined by 
F: (x!,..., x"; b}, ee b") (xt, -.., x"; f (1, x, b), -.., f"(1, x, b)). 


This map corresponds to the map X, = » b' Ei > (p, Exp X p) with domain 
in T(M). We have already seen that of'/obi = 6; when bt = ~- = b" = Q. 
Therefore the Jacobian matrix of F is nonsingular at any point P E g 

.,0) of R” x {0} for which (x',..., x") = o(p) with pe V. Thus by the 
inverse function theorem for each 0, € T(M), 0, the zero vector at pe V, there 
is a neighborhood which is mapped diffeomorphically onto an open subset of 
U x UCM x M by this mapping, which takes the tangent vector X, at p to 
a pair of points of U, X,++(p, Exp X,). Now V was originally chosen as a 
relatively compact neighborhood of q lying in a coordinate neighborhood 
U, ọ. It was used to obtain an ¢ > 0 for which the open set {X,|p eV and 
|X ,,|| < e} of T(M) was in the domain 2 of Exp. This is also a set on which 
the mapping X, > (p, Exp X,) is given in local coordinates by F. From what 
we have just said we may restrict V and e further (without changing notation) 
so that the resulting neighborhood N(V, £) = {X,|pe V and |X,|| < £} of q, 
0, is mapped diffeomorphically onto an open set W c U x U. Although W is 
not of the form B x B, it does contain the diagonal set {(p, p)|pe V}. We now 
let B c V be a neighborhood of q such that B x B c W. Then B x B is the 
diffeomorphic image of some open subset of N(V, £) which can be described 
by N,={X,|peB, Exp X,¢B}. Putting these facts together gives the 
following result. 


(6.9) Theorem Let U, be a coordinate neighborhood of M and qe U. 
Then there exists a neighborhood B c U of q and an e > 0 such that any two 
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points p, p' of B can be joined by a unique geodesic of length less than £. This 
geodesic is of the form Exp tX,,0 < t < 1, and lies entirely in U. It follows 
that for each pe B, Exp, maps {X,| ||X,|| < ¢} diffeomorphically into an open 
set N, such that Bo N, c U. 


We remark that our choice of the neighborhood Ñ, does not allow us to 
conclude that whenever X,eN,, then Exp tX EB for all 0<t <1. 
Thus in general B does not necessarily have the property that p, p'e B are 
joined by a geodesic lying entirely in B. We have made our choices so that 
for each pe V, Exp, maps the « ball {X,| ||X,|| < £} into U diffeomor- 
phically and clearly has B in its image, thus each pe B has a normal neigh- 
borhood N, with B c N, < U. 

With somewhat more effort one can show that it is, in fact, possible to 
select a neighborhood B of each point q on a Riemannian manifold with the 
property that each pair of points p, p'e B may be joined by a unique (mini- 
mizing) geodesic segment lying entirely in B. Such neighborhoods are called 
geodesically convex and the proof of their existence is due to Whitehead [1]. 
It may be found in several of the references, for example, Helgason [1], 
Kobayashi and Nomizu [1], or Bishop and Crittenden |1]. 


Exercises 


1. A section of T(M) is a C” mapping F: M > T(M) such that zo F = 
identity on M. Prove that the sections of T(M) correspond precisely to 
C”-vector fields on M. 

2. Show that in a manner quite analogous to the definition of T(M), a 
manifold structure can be defined on 7'(M) for any fixed r and that 
covariant tensor fields of order r correspond exactly to sections of M 
into 7'(M). (A “section” is defined as in Exercise 1.) 

3. Show that the set of all unit tangent vectors at all points of M form a 
submanifold of T(M). Discuss the existence of sections with image in 
this submanifold. 

4. Let M be imbedded in R” as a submanifold and for each pEM let 
N, © T,(R™) be the subspace of vectors orthogonal to T,(M). Show that 
N(M) = |]pem N, can be given a structure of a C® manifold such that 
the natural mapping x: N(M) > M given by mapping N,(M) > pis C”. 
Proceed by analogy with T(M). 

5. Show that if G is a Lie group, then T(G) is diffeomorphic to G x R", 
n = dim G. 

6. Let F: M > N bea C”? mapping of manifolds. Show that F,: T(M) > 
T(N), defined by the usual mapping F,: T,(M) > T,(N), is a C® map- 
ping of manifolds and commutes with the projection mappings. 
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7 Some Further Properties of Geodesics 


Until now we have considered Exp only locally, restricted to a neighbor- 
hood of the zero vector at each point of the manifold. Our main purpose in 
this section is to prove, following Milnor [1], a theorem due to Hopf and 
Rinow [1] which gives conditions that the domain 2 of Exp be the entire 
tangent bundle T(M). Equivalently, this means that Exp X, is defined for 
every pe Mand X „€ T,(M). First, however, we wish to show that in all cases 
the domain 2 is an open set. 


(7.1) Theorem 2 is an open subset of T(M) and Exp: 2 > M is a Ce 
mapping. 

Proof We adopt the notation of the previous section and recall that to 
each coordinate neighborhood U, p of M corresponds a coordinate neigh- 
borhood U,@ of T(M). We have U =x '(U) and (0) = e(U) x 
R" c R" x R". In fact, if p(p) = (x',...,x") and E,,...,E, are the 
coordinate frames, 


PX p) = OO, yE) = (xt, 5 x"; y's) 


The natural mapping 2: T(M)—M is given in local coordinates by 
(n(X,)) = (x',..., x"); it is an open C® mapping and has rank n at every 
point. Suppose that p(t) is a geodesic on M. Then X p = dp/dt, its velocity 
vector, defines a curve t > X „p on T(M) with z(X x») = p(t). An examina- 
tion of the method by which we passed from the equations of geodesics (5.3) 
to first-order equations (5.5) reveals that on @(0) (denoted by W in 
Lemma 5.4) we considered the first-order system corresponding to the 


vector field 


Now we define a vector field Z on U < T(M) so that @,(Z) = Z’. If, as in 
Lemma 5.4, the solutions of (5.5) are given by x‘(t) = f‘(t, a, b) and y'(t) = 
dx'/dt, i = 1,..., n, then on U the integral curves (solutions) of the system of 
equations defined by Z are of the form 


dx! dx" 
~-1 1 n a 
0) ( (t), ..-5 x%(t)s r a) 


where p *(x*(t),...,x"(t)) = p(t) is a geodesic in U = x(U). In brief 
X „y = dp/dt is a solution curve of Z on x” ‘(U) € T(M) ifand only if p(t) is 
a geodesic on U. From its geometric meaning, or by a tedious computation 
for change of coordinates, we see that Z is a vector field defined intrinsically 
on all of T(M), independent of the particular expression in a coordinate 
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system, that is, the components (x',...,x”, — $; DAY YA., —V Thy’) 
transform as they should for a vector field when we pass to other coordinates 
so that Z is globally defined and depends only on the Riemannian connec- 
tion and metric. The geodesics on M are therefore exactly the projections by 
z: T(M)—M of the integral curves of Z. Thus the conclusion of the 
theorem follows from Theorem IV.4.5. [Note that IV (4.3)-(4.6) all hold!] 
i 


We have seen that geodesics on Riemannian manifolds generalize 
straight lines in R” in the following sense: Their unit tangent vector as we 
move along the curve is constant. But another basic property which charac- 
terizes straight lines in R” is the famous minimizing property of being the 
shortest curve joining any two of its points. We now examine in some detail 
the extent to which this property generalizes. A few examples will show that 
there are some difficulties. 

One of the more interesting is the right circular cylinder M with the 
Riemannian metric obtained by considering the plane R? with its usual 
metric as universal covering (see Exercise 5.1). Then the geodesics on the 
cylinder are exactly those curves which go into straight lines if we roll 
the cylinder along the plane: vertical generators and helices. Thus two points 
not on a circle whose plane is orthogonal to the axis will be joined by an 
infinite number of distinct geodesics of different lengths (Fig. VII.12). 

On S? the larger of the two arcs of a great circle which join two points p 
and q (which are not at opposite ends of at diameter) is not of minimal 
length, even among nearby circular arcs. Finally, for the plane with the 


R? 


(Lo) 


(a) (b) 
Figure VII.12 
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origin removed, the points (— 1, 0) and (+ 1, 0) cannot be joined by a shor- 
test curve at all. In view of all this it is remarkable that we are able to salvage 
something, in fact almost everything, if we limit ourselves to points close 
together and short geodesics. Let us recall that we have defined the length of 
a piecewise differentiable curve p(t) (of class D') over a < t <b by L= 
f? ||dp/dt|| dt; this is the Riemann integral of a piecewise continuous function. 
It is, by definition, equal to the sum of the integrals over the intervals of 
continuity [on each of which p(t) is of class C']. Then we may elaborate 
Theorem 6.9 as follows. 


(7.2) Theorem For each qe M, a Riemannian manifold, there exists a 
neighborhood B and an £ > Q such that each pair of points of B can be joined by 
a unique geodesic of length L < e. The length L’ of any piecewise C' curve 
joining these two points is > L. Moreover L = L if and only if these paths 
coincide as point sets, or equivalently, when parametrized by arclength, are 
identical. 


This theorem is established using Theorem 6.9 and the following two 
lemmas. According to Theorem 6.9, given qe M, there exists B and £ > 0 
such that each pair of points p, p' of B can be joined by a unique geodesic of 
length L < e. In fact, the equation p(t) of the geodesic is given by p(t) = 
Exp tX,, 0<t< 1, and |X,|| = L. The open set B lies in a coordinate 
neighborhood U, @ which contains this geodesic, and Exp, is a diffeomor- 
phism of the open ball of vectors X, of T,(M) of length ||.X,|| < £ onto an 
open set N,, of U containing B. This means that any sphere {X ,|||X || = r < £} 
maps diffeomorphically to a submanifold of U, denoted by S, (and called 
a geodesic sphere). The following lemma goes back to the work of Gauss. 


(7.3) Lemma Let pe B and suppose Exp, maps the open e-ball of T,(M) 
diffeomorphically onto N, > B. Then the geodesics through p are orthogonal 
to the geodesic spheres S, defined by S, = {Exp, X,|||X,|| =r} r < e. 


Proof Let X(t) be a curve in T,(M) with | X(t)|| = 1, a < t < b. Any 
geodesic from the point p may be written r > Exp, rX, 0 <r <e, with 
|X || = 1 and any curve on S, in the form t > Exp, rX(t). The mapping 
(r, t) > p(r, t) = Exp, rX(t) maps the rectangle [0, e] x [a, b] differentiably 
into M, and we will show that the inner product (ép/ér, ép/ct) = 0 for 
each ro, tg. These are the tangent vectors to p(r, to), the geodesic curve, 
and to p(ro, t) a curve on the geodesic sphere S, , respectively, intersecting at 
P(ro, to). If this inner product vanishes for every (ro, to), this will establish 
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the lemma. We first show that (ép/ér, ¢p/ét) is independent of r. By a basic 
property of differentiation, 


D (ép ép D ép @ép Cp Dép 

ar @ 2 E & ar’ 4 H (> ör r. 
Of these the (D/ér)(ép/ér) = 0 since holding t fixed and allowing r to vary 
gives a geodesic through q with @p/ér as its units tangent vecto. In the second 
term, if we interchange the order of differentiation (see Exercise 3.5), we 
obtain (ép/ér, (D/ét)(ép/ér)) = 4(D/ét)(ép/ér, ép/ér). Since ||@p/ar|| = 
|| X(2)|| = 1, we see that this is also zero and therefore (@p/ér, ôp/ôt) is 
independent of r. But p(0,t)=p, so ôp/ôt=0 at r=O and thus 
(ép/ér, Op/ét) =0 for all r. Hence for each (rọ,tọ) the inner product 
(6p/ér, ðp/ðt) = 0, which completes the proof. 


Now we consider a (piecewise) differentiable curve p(t), a < t < b, in 
N, — {p}; it has a unique expression of the form p(t) = Exp, r(t)X(t), where 
|| X(t)|| = 1. Using this notation, we state the following lemma. 


(7.4) Lemma f? ||dp/dt|\| dt > |r(b) — r(a)|. Equality holds if and only if r(t) 
is monotone and X(t) is constant. 


Proof Again we consider the map (r, t)—> p(r, t) = Exp, rX(t) from 
[0, €] x [a,b] > U. The curve p(t) connects the spherical shells S, of radius 
r = r(a) and r = r(b) in U,. We have p(t) = p(r(t), t) and, using r’ for dr/dt, 

ap 

dt 

Since ||@p/ér = || X(t)|| = 1 and (ép/ér, ép/ét) = 0 (Lemma 7.3), we have 

|dp/dt\|? = |r'(t)|? + lêp/ôtl? > |r|. Equality holds if and only if 
Op/ét = 0, that is, X(t) = constant. Hence, 


b w b 
| ap | r'(t) dt 


dt 
In the last inequality, we have equality only if r(t) is monotone; thus 
{2 \|dp/dt|| dt = |r(b) — r(a)| if and only if r(t) is monotone and X(t) = 
constant. This proves the lemma. 


OP ex OP 
=a OF oe 


b 
a> f ir'(t)| dt > = |r(b) — r(a)|. 


Proof of Theorem 7.2 We continue the notation of the lemmas. Suppose 
p(t), O < t < 1, is a piecewise smooth curve joining p = p(0) to p' = p(1) = 
Exp,rX,éN,,0 <r <eand ||X,]|| = 1. Let ô satisfy 0 < ô < r, and consider 
the segment of the curve joining the shell of radius 6 around p to that of 
radius r. According to Lemma 7.4, the length of this segment is > r — 6 with 
equality holding only if the curve coincides as a point set with segment of the 
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radial geodesic from p cut off by these shells, its length being r — ô. Thus the 
portion of the curve between these shells has strictly greater length than 
r — ô unless it coincides as a point set with a radial geodesic. Letting ô 
approach zero gives the result of the theorem. | 


It is probably quite obvious that the statement of Theorem 7.2 is bound 
up with the notion of distance on M, that is, the metric d(p, p’) which we 
considered in Section V.3. Recall that d(p, p’) is the infimum of the lengths of 
all piecewise differentiable curves from p to p' and that we showed that the 
metric topology and the usual topology coincided. The theorem just proved 
guarantees that for each point qe M there is ane > Oanda neighborhood B 
of diameter less than e (in terms of d) such that for every pair of points 
p, p' € B there is a unique geodesic segment from p to p’ whose length is the 
distance d(p, p’). More generally, we have the corollary which follows. 


(7.5) Corollary If a piecewise differentiable path (of class D') from p to qon 
M has length equal to d(p, q), then it is a geodesic when parametrized by 
arclength. 


Note that it follows that the path is C“! Of course the hypothesis and the 
definition of d(p, q) imply that the path has minimum length among all such 
curves. The proof is immediate: any segment of the path lying in a 
sufficiently small neighborhood (as above) must also have as length the 
distance between its endpoints (or it could be replaced by a shorter path) 
and thus it must be a geodesic. Since the curve is a geodesic locally, it is a 
geodesic. 


(7.6) Definition A geodesic segment whose length is the distance between 
its endpoints is called a minimal (or minimizing) geodesic. 


Unlike the local situation in Theorem 7.2, we have seen that on an 
arbitrary manifold there may be points p, q which are not connected by a 
geodesic at all, for example, R? with the origin removed. Moreover, even if 
there exist such minimal geodesics joining p, q as there do on the sphere, 
they need not be unique—for example, there are an infinite number of 
minimal geodesics joining the north and south poles. The question of uni- 
queness is not simple and we will not go into it here. For details, as well as 
many additional theorems on geodesics, the reader should consult 
Milnor [1]. 

However, the existence question as well as some other questions we have 
raised are answered in a beautiful theorem of Hopf and Rinow [1]. Before 
stating this theorem we remember that each geodesic and geodesic segment 
is contained in a maximal geodesic, that is, a geodesic p(t) such that p(t) is 
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defined for a < t < b and not for any larger interval of values. If a = — œ 
and b = +00, we say that the geodesic can be extended indefinitely. This is 
always true of a closed geodesic (a geodesic which is the image of a circle, for 
example, a great circle on S°). If every geodesic from pe M can be extended 
indefinitely, then the domain 2 of Exp contains all of T,(M) and conversely. 


(7.7) Theorem (Hopf and Rinow) Let M be a connected Riemannian 
manifold. Then the following two properties are equivalent: 


(i) Any geodesic segment can be extended indefinitely. 
(ii) With the metric d(p, q), M is a complete metric space. 


The proof will be based on a lemma. Assume any geodesic segment 
t > p(t), a < t < b, can be extended to a maximal geodesic curve t > p(t), 
defined for —æ <t < +o. In order to see that M is complete (every 
Cauchy sequence converges), it is sufficient to show that every closed and 
bounded set is compact; and to prove this we need the following lemma, 
which is of interest in itself. The proof is modeled on that of Milnor [1]. 


(7.8) Lemma IfM has the property that every geodesic from some point 
pé M can be extended indefinitely, then any point q of M can be joined to p by 
a minimal geodesic [whose length is necessarily d(p, q)}- 


Proof Let q bean arbitrary point of M and let a = d(p, q). Any geodesic 
from p may be written p(s) = Exp sX „with X „a unit tangent vector at p and 
s arclength measured from p = p(0). We must show that for some X, with 
|X, = 1, p(a) = Exp aX, = q, so that s> Exp sX, 0 < s <a, is the mini- 
mal geodesic segment. We will use the following fact, which is also of some 
interest. 


(7.9) Suppose that po, Pi, -.-, Pa are points of M and that 
(*) dpo , pı) + (Pr, P2) +: + d(Pn- 1, Pn) = (Po » Pa). 


If a piecewise differentiable curve contains p;, Pi+1» ---, Piz, and has length 
equal to d(p;, Pi+1) +7 + d(p;4,—1, Pi+,), then it is a geodesic segment from 
Pi to p;,,. Conversely, if po, ..., p, lie on a minimal geodesic segment, in that 
order, then (*) holds for them. 


It is easily seen that it is enough to verify this for r = 2. The curve C from 
Pi to pi+1 to p;,2 has length L = d(p;, p;.1) + d(pi+1, Pi+2)- By the triangle 
inequality L > d(p,, p;+2). If equality holds, C is a (minimizing) geodesic 
segment from p; to p;+2 as required (Corollary 7.5). But this must be the 
case; otherwise we have 


d(Pi , Pi+1) + A(Pi +1. Pi+2) > AP; . Pi+2) 
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by the triangle inequality and then substituting in (+) we have 


d(po , Pi) + °° + A(D;, Pig 2) + °° + d(Pn-1, Pn) < (Po » Pr) 


which contradicts the triangle inequality. Finally, the last statement follows 
immediately from the fact that any subsegment of a minimal geodesic seg- 
ment is also minimal. 

To return to the proof of Lemma 7.8, using Theorem 6.9 we suppose 
ô > 0 to be chosen so that S; = {p' | d(p, p’) = ô} is a geodesic sphere in 
some normal neighborhood of p, sufficiently small to ensure that each radial 
geodesic from p to S; is minimal. Then since $; is compact, there is a po € Ss 
satisfying 

d(po .q) = inf d(p’, q). 


yess 


Let X, be the unit vector at p such that pọ = Exp 6X,. We must have 


d(p, Po) + d(po » q) = d(p, q), 


otherwise there is a piecewise differentiable curve joining p to q whose length 
is less than d(p, po) + d(po, q) = 6 + d(po, q). Since it must intersect S; at 
some point p’ and its length from p to p’ can be no less than ô, we have 
d(p', q) < d(po,q) contrary to our choice of py. We now consider all s’, 
0 < s'< a, such that the geodesic segment s Exp sX,,0<s<s’, is mini- 
mizing and such that 


d(p, Exp s'X ,) + d(Exp s'X p. q) = d(p, 4). 


By the continuity of the conditions the collection ofall such s’ forms a closed 
interval 0 < s' < b. If b = a, then Exp aX, = q, which proves the lemma. 
Suppose b < a; let p, = Exp bX,, then d(p, pı) + d(p1,q) = d(p, q) and we 
may obtain a contradiction by repeating the arguments above as follows. Let 
S„. n > 0, be a small geodesic sphere (with radial geodesics minimizing) ina 
normal neighborhood of p, = Exp bX, and choose a point p, on S, such 
that 
d(p2,q) = inf d(p", q). 


D" € Sy 


Then, as before d(p;, p2) + d(p2, q) = d(pı, q) and therefore 


d(p, pı) + d(pı, P2) + d(p2. q) = d(p, pı) + (Pi, 4) = d(p, q). 


By (7.9) the geodesic p(s) = Exp sX, from p to p, together with the (radial) 
geodesic in S, from p, to pz is a single (minimizing) geodesic segment from p 
to p, of length d(p, p2) > b, which contradicts the definition of b. Therefore 
b = a and the lemma follows. i 
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Proof of Theorem 7.7 We now turn to the proof of the theorem. We 
show first that (i) implies (ii). Let K be a closed and bounded subset of M. 
We will show that K is compact. Suppose pe K and a = sup,. x d(p, q); a is 
finite since K is bounded. By Lemma 7.8, for any q€ K there is a minimizing 
geodesic from p to q; its length is d(p, q) which must be no greater than a. It 
follows that K c Exp, Ba, where B, = {¥,|||¥,|| < a}, the closed ball of 
radius a in T,. Since B, is compact and Exp is continuous, Exp, B, is 
compact. K is a closed subset of Exp, B,, so it must be compact. This 
completes the proof that M is a complete metric space according to 
Exercise 6. 


(7.10) Remark Any manifold M having property (i) has the property that 
the domain 2 of the exponential function is all of T(M), that is, that the 
vector field Z of Theorem 7.1 is complete. Actually, in proving that (i) im- 
plies (ii), we used only the weaker hypothesis of the lemma: every geodesic 
from some point pe M can be extended indefinitely, that is, Z > T,(M) for 
some pe M. It was not necessary to assume pe K, for if K is bounded, then 
for any pe M the distances d(p, q) are bounded for all ge K. 


Next we show that (ii) implies (i), that is, we suppose that every Cauchy 
sequence on M converges and show that this implies the extendability of 
geodesics. Suppose to the contrary that there is a geodesic ray, p(t), 
0 < t < tọ which cannot be extended to t = tọ; we may assume, changing 
parameter if necessary, that t is arclength. Let {t„} be an increasing sequence 
of parameter values with lim, t, = to. Denoting by p, the points p(t,), we 
have d(p,, Pm) < |t, — tm | since the right-hand side is the length of a curve 
(the geodesic) from p, to p,,. Thus {p,} is a Cauchy sequence and we denote 
its limit by q, q = lim,..., Pa = lim,,.,, p(t,). Now we let B be a neighbor- 
hood of q, and £ > 0 be so chosen that each pair of points p, p’ of B are 
joined by a unique geodesic of length less than e. Of course this geodesic is 
minimizing, or equivalently its length is d(p, p'). We let N be an integer 
which is large enough so that for n,m > N we have d(p,, Pm) < and 
d(p,, q) < £ and Pn, PrE B. 

Consider n > N fixed and suppose m > n; then we have 


d(Pn > Pm) + dlPm +4) = (tm — tn) + d(Pm» q). 


Since tm — t, is the length of our geodesic from p, to p,, and is less than z, this 
segment of the geodesic is minimal. Now let m > œ and by continuity we 
have d(p,,, q) = to — t, for n > N. Applying this to m > n, we have for all 
m>n>N, 


d(Pn > Pm) + A(Pm > 4) = ba ta + to — bn = to — ta, = d(p, » q). 
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Now choosing a fixed m > n, we see that the unique geodesic segment from 
Pn tO Pm of length d(p,, Pm) together with the unique geodesic segment from 
p, to q of length d(p,, q) has length equal to the distance d(p, , q) and there- 
fore is a single (unbroken) geodesic from p, to q. However, it coincides with 
the given geodesic p(t) for t, < t < tm, that is, from p, to Pm; thus it is an 
extension of this to a geodesic segment com p to q. This shows that p(t) can 
be extended to t = fp. 

We note that it is immediate that a geodesic segment p(t), 0 < t < tọ, 
can be extended beyond its endpoints; this follows at once from the fun- 
damental existence theorems. Thus any geodesic on a complete manifold can 
be extended indefinitely, Exp, is defined on all of T,(M) for every p, and Exp 
has the entire tangent bundle T(M) as its domain, that is, 2 = T(M). ff 


The following corollary depends on the fact that a compact metric space 
is complete. 


(7.11) Corollary If a connected Riemannian manifold M is compact, then 
any pair of points p, q€ M may be joined by a geodesic whose length is d(p, q). 


(7.12) Corollary Let F,,F,:M—M be isometries of a complete, con- 
nected Riemannian manifold. Suppose that F ,(p) = F2(p) and F,, = F2, on 
T,(M) for some pe M. Then F, = F3. 


Proof Let qe M and let p(s), O < s < l, be a geodesic from p to q, 
p = p(0) and q = p(l). Then F;(p(s)) is a geodesic from F(p) to F(q), 
i= 1,2. Since F,(p) = F(p) and Fy, (p(0)) = Fr, (p(0)),these geodesics 
coincide and 


F,(q)= F (p(!)) = F (p(0)) = F,(q). I 


Exercises 


1. Let M be a complete Riemannian manifold and let q€ M. Identify T,(M) 
with R", n = dim M, as a manifold by choosing an orthonormal basis at 
q. Then Exp,: T,(M) > M is a C” mapping of R” onto M with 0 map- 
ping to q. Suppose M = S”, the unit sphere with the usual metric. Prove 
that rank Exp, < n for X, if ||X,| = kr, k = +1, +2,.... 

2. Show that on a Riemannian manifold M which has R” as a Riemannian 
covering (z is a local isometry), the rank of Exp, is n for all ge M. 

3. Let M be a complete Riemannian manifold and z: M > M a covering, 
Show that there is a unique Riemannian metric on M such that 7 is a 
local isometry and show that with this metric M is complete. 

4. Give a simple example of a Riemannian manifold diffeomorphic to R" 
but such that no geodesic can be extended indefinitely. 
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5. Show by example that there is a Riemannian manifold on which dist- 
ance between points is bounded, that is, d(p, q4) < a, a > 0 fixed, but on 
which there is a geodesic of infinite length which does not intersect itself. 

6. Show that if M is a metric space in which every bounded set is relatively 
compact (has compact closure), then M is complete. 


8 Symmetric Riemannian Manifolds 


(8.1) Definition A connected Riemannian manifold M is said to be sym- 
metric if to each pe M there is'associated an isometry cp: M > M which is 
(i) involutive (ø? is the identity), and (ii) has p as an isolated fixed point, that 
is, there is a neighborhood U of p in which p is the only fixed point of Op: 


As examples we cite Euclidean n-space, in which case ¢, is reflection in p; 
and S”, the unit sphere in R"*!, with the metric induced by R"*? In the case 
of the sphere, c, is again reflection i in p—for each q, o,(q) = q', where q and 
q' are equidistant from p on a geodesic (great circle) through p (Fig. VII.13). 


Oye (Xp) = -Xp 


Figure VII.13 


In the case of S” we note that o,(p) = p and o,(p*) = p*, p* denoting the 
point antipodal to p. Thus, in general, o, may have other fixed points than p. 
Note also that the first example is a noncompact manifold and that the 
second is compact. A symmetric space, as we shall see, is always complete. 


(8.2) Lemma If pe M, a Riemannian manifold, and G, is an involutive 
isometry with p as isolated fixed point, then o,,(X,)= —X, and 
o (Exp X,) = Exp(— X „) for all X ,€ TM) for which both sides are defined. 
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Proof Since a? is the identity, the same holds for (c,,)° on 7,(M). This 
means that the eigenvalues of o,, on T (M) are +1. However, if +1 is an 
eigenvalue, then there exists a vector X, # 0 such that o,,(X,) = X,. For 
any isometry F: M > M, F - Exp = Exp = F, since geodesics are preserved. 
This means that o,(Exp tX) = Exp tX so the geodesic through p with initial 
direction X „ is pointwise fixed. This means that p is not an isolated fixed 
point of o,. Thus +1 is not an eigenvalue and o,, = —I, I being the 
identity. Since o is an isometry, o,(Exp X,) = Exp o,,(X p) = Exp(—X>,). 
This means that ø, takes each geodesic through p onto itself with direction 
reversed, exactly as in the two examples we have cited. | 


The following corollary is an immediate consequence of Corollary 7.12 
and the lemma: 


(8.3) Corollary Given any complete Riemannian manifold M and point 
pe M, there can be at most one involutive isometry co, with p as isolated fixed 
point. 


(8.4) Theorem A symmetric Riemannian manifold M is necessarily 
complete, and if p, q€ M, then there is an isometry o,—corresponding to some 
re M—such that o,(p) = q. 


Proof First we show that M is complete by proving that every geodesic 
can be extended to infinite length. Suppose p(s) is a geodesic ray with s as 
arclength, which is defined for 0 < s < b. We will show that it can be ex- 
tended to a length | > b. Let so = žb, and let cso) be the symmetry in p(so). 
It takes the geodesic p(s) to another geodesic through p(s) whose tangent 
vector at p(so) is —(dp/ds),, and whose length is the same as that of p(s). 
Since it has a common tangent with p(s) at p(so), it coincides with p(s) on the 
interval 4b < s < b and thus extends it to a length > 3b, which proves the 
statement (Fig. VII.14). 

Using this it follows easily that given any p, q€ M there is an isometry of 
M taking p to q. In fact, let r be the midpoint of a geodesic from p to q. Then 


the isometry c, takes this geodesic onto itself and carries p to q. | 
P{So) 7 
ol (0) 
Op(r) 
p10) 
Figure VIE14 
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We remark here that it is easy to verify that the isometries of a Rieman- 
nian manifold M form a group I(M); it is a subgroup of the group of all 
diffeomorphisms of M. A classical theorem due to Myers and Steenrod [1] 
asserts that it is a Lie group and acts differentiably on M. According to the 
theorem just proved it is also transitive when M is a symmetric space. 

Before developing further the properties of symmetric spaces we prove a 
theorem which gives a rich collection of examples. 


(8.5) Theorem Every compact connected Lie group G is a symmetric space 
with respect to the bi-invariant metric. 


Proof Let Y: G > G denote the diffeomorphism which takes each ele- 
ment to its inverse, y(x) = x~'. This map is clearly involutive and in fact it is 
an isometry of G with e, the identity, as isolated fixed point. To establish this 
we recall that to each X, e€ T,(G) corresponds a uniquely determined one- 
parameter subgroup t> g(t) with g(0) = X, (Section IV.6). Since y(g(t)) = 
g(—t), by the chain rule we obtain 


Hale) = va (@10)) = (FUD) _ = -20 = =x.. 


This means that Ye = —1, which is an orthogonal linear transformation (or 
isometry) of any inner product on T,(G). Let ae G be arbitrary and denote 
left and right translations by any ge G by L, and R,, respectively. We may 
write 


W(x) = x7! = (ax) la! = R,-1 © Y ° L,-1(x). 
Hence Wya: T(G) > T,-1(G) may be written 
Waa = (R a- tele ° Vye ° (L a- ide 


which is a composition of three linear mappings each of which is an isometry 
of the inner product determined by the bi-invariant metric (R,-, and L,-: 
induce isometries on every tangent space and w,, is an isometry as shown 
above). It follows that y: G > G is an isometry. If we consider a normal 
neighborhood of e (as in Definition 6.7 with q = e), then by Lemma 8.2 y 
is given in local coordinates by reflection in the origin, and hence e is an 
isolated fixed point. 

Now let ge G. We define the isometry o,:G > G which has g as an 
isolated fixed point by c, = L; ° R,o Y, that is, o,(x) = gx~*g. It is an 
isometry since R,, L,, and w are isometries, and it is easy to check that it is 
involutive and has g as isolated fixed point. i 


(8.6) Example Let G = SO(n) be the group ofn x n orthogonal matrices 
of determinant + 1. According to Example IV.6.7, the tangent space T,(G), 
e= I, then x n identity matrix, may be identified with the skew symmetric 
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matrices A = (a;,) = — A’ in the sense that X, = )/,, ; a,,(6/Cx,) is tangent at 
I to SO(n) considered as a submanifold of Gl(n, R) < R”. The one- 
parameter subgroups are of the form Z(t) = e'4. Identify g with 7,(G) and 
compute Ad B: T,(G) > T,(G) as follows. First one verifies from the definition 
of e'4 that 


Bet4B7} = ei BAB} 


(compare Exercise IV.6.6). Since Ad(B) acting on T,(G) is just the linear map 
of the tangent space induced by the mapping Z > BZB™' on SO(n), we see 
that Ad(B) takes the component matrix 4 = (a;;) of X, to BAB~'. Now 
define on T,(G) an inner product (X,, Y,) for x, = ¥a,,(6/éx;,,)), 


Y= 5 c;j(0/2x;;) b 


It is clearly bilinear and symmetric; moreover, since 
' 2 
(Xe, Xe) = tr A'A = } ajaj = } ai, 
ij 


it is positive definite. Finally for Be SO(n) 


(Ad(B)X, , Ad(B)Y,) = tr((BAB™')BCB™*) 


tr(BA'CB™') = tr A'C = (X,, Y). 


This means that this inner product determines a bi-invariant Riemannian 
metric on G (Lemma VI.3.4). By Theorem 8.5, G is a symmetric space with 
this Riemannian metric. A similar procedure may be employed to obtain the 
bi-invariant Riemannian metric for other compact matrix groups. 


We now develop the general properties of symmetric spaces somewhat 
further. Let M be any symmetric Riemannian manifold and p(t), 
— œ < t < œ, be any geodesic on M. The symmetry opa associated with 
any point of this geodesic maps the geodesic onto itself and reverses its sense. 
If c is a fixed real number, then we denote by z, the following composition of 
two such isometries, T. = 6 pe) ° Fpc/2)- Since T, maps the geodesic onto itself 
and preserves its sense, its restriction to the geodesic must be of the form 
t.(p(t)) = p(t + constant). In fact, since t,(p(0)) = Cpo) ° € pe (PO) = 
6p) P(c) = p(c), we see that the constant is c and t,(p(t)) = p(t + ¢). 

Now we consider how 7, acts on the tangent space at a point of p(t). 
Suppose in fact that X „o)€ T,o)(M), and define a vector field X „n along p(t) 
by the formula X pe) = Tiy X po) - Let Xj, be the unique vector field satisfy- 
ing X o) = Xpo) Which is constant along the geodesic p(t). We wish to show 
that these two vector fields coincide. Now for any real number to, ¢ stoj X po 
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is a parallel vector field along p(t) since o pro is an isometry. On the other 
hand, 6 p05) X pio) = —X pao) Since p(to) is the fixed point of the symmetry. 
Because — X; is also a constant vector field along p(t) and agrees with the 
field 6 prox X pe) at one point, it must agree everywhere. Applying this argu- 
ment twice we see that te, Xj) = Xpe+o for all t and each constant c. 
Letting t = 0 and c = t proves our assertion. We have proved the following 
theorem. 


(8.7) Theorem Let p(t), —2c < t < œ, be a geodesic of a symmetric mani- 
fold M and t, the associated isometry (defined above) for each real number c. 
Then 1,(p(t)) = p(t +c) If Xpo is any element of Tyo)(M), then 


X po = Tre X po) IS the associated parallel (constant) vector field along p(t), 
that is, as t varies t,4: T,9)(M) > T,(M) is the parallel translation along the 
geodesic. 


(8.8) Remark Note that if p, = p(c,) and p, = p(c2) are any two points 
of a geodesic p(t), —2<t<oc, then by the same argument 
pr © Fp, (P(t)) = p(t + 2(c. — c,)) and (6p, ° ,,), Maps any parallel vector 
field along p(t) to a parallel vector field. 


Theorem 8.7 will be used to prove a fact about compact Lie groups 
which is not at all obvious; it is given as a corollary to the following theor- 
em. [It is because of this theorem that the notation Exp tX is used for 
geodesics in Riemannian manifolds.] 


(8.9) Theorem Let M = G, a compact, connected Lie group with the bi- 
invariant metric and let X,€T,(G). Then the unique geodesic p(t) with 
p(0) = e and p(0) = X, is precisely the one-parameter subgroup determined by 
X.. All other geodesics are left (or right) cosets of these one-parameter 
subgroups. 


Proof Given a geodesic p(t) with p(0) = e, we consider the isometry 
F 1s) F co) Of G. By the remark above we see that this maps the geodesic onto 
itself with p(t) being mapped to p(t + 2s). But using our formula for ¢,0nG 
together with p(0) = e, we have 

F p(s) Fp0) P(t) = p(s)p(t)p(s). 
[The right-hand side is the group product of p(s), p(t), and p(s).] Thus for all 
t, S, 
p(s P(Ðp(s) = p(t + 2s). 

Using various t and mathematical induction, this gives for arbitrary s and 
any integer n 


(p(s))" = p(ns). 
PDF pdfFactory Pro www.fineprint.cn 


356 VIL DIFFERENTIATION ON RIEMANNIAN MANIFOLDS 


In particular, if a, b, c, d are integers with bd # 0, we have 


a c B 1 ANERE 1 ad 1 be a c 
Plot a} Nba ~ Plea) Nba) ~ Plo) Play 
Thus for any rational numbers we have 


pir, + r2) = P(r:)p(r2)- 
Since p(t) depends continuously on t we see that this holds for all real 
numbers, and thus any geodesic with p(0) = e is a one-parameter subgroup. 
However, since there is exactly one geodesic and one such subgroup with 
given p(0) = X,, we see that the first sentence of the theorem is true. The 
second follows at once if we use the fact that either left or right translations 
are isometries, and hence preserve geodesics, together with the fact that a 
geodesic through any ge G is uniquely determined (with its parametriza- 
tion) by its tangent vector at g. i 


(8.10) Corollary If G is a compact, connected Lie group, then any g €G lies 
on a one-parameter subgroup. 


Proof With the bi-invariant Riemannian metric G is a symmetric 
Riemannian manifold. Moreover it is complete and hence any pair of points 
can be joined by a geodesic. If g € G, then the geodesic segment from e to g is 
on a one-parameter subgroup according to the theorem. i 


(8.11) Example If G = SO(n), then the geodesics, relative to the bi- 
invariant metric of Example 8.6 are the curves p(t) = e'4 (A any skew sym- 
metric matrix) and their cosets. 


In the case of a group G with bi-invariant metric we can now establish 
a relation between the Lie derivative and the Riemannian differentiation V 
of vector fields, which we shall need in the next chapter. 


(8.12) Theorem If X and Y are left-invariant vector fields on G and V is as 
above, then we have 


Vx Y = 4X, Y] =4L,Y. 


Proof Suppose that Z is any left-invariant vector field. Then we will 
compute Vz, Z. If g(t) is the uniquely determined one-parameter group with 
g(0)=e and g(0)=Z,, then for any vector field Y we have 
Vz. Y = (DY, p/dt),=9. However, Z,, = dg/dt, and g(t) is a geodesic. Thus 


DZ,w/dt = (D/dt)(dg/dt) = 0 and Vz,Z = 0. Since Z and the metric are 
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left-invariant, it follows that V;z;Z=0 everywhere on G. Thus 
Vx+y(X + Y) = 0, from which we conclude that 


VY + VyX =0. 


On the other hand, we know that for any pair of vector fields a Riemannian 
connection satisfies the identity Vy Y — Vy X =[X, Y]. Combining these 
two identities gives the lemma. 


Exercises 


1. Let T: V > V bea linear transformation on a vector space over R which 
is involutive, that is, T? is the identity. Show that there is a basis of V 
such that the matrix of T is diagonal with diagonal elements equal to 
+1. 

2. The unitary group U(n) consists of all complex matrices A which satisfy 
the relation AA* = I, A* "= ‘4, the transpose conjugate. Show that U(n) 
can be considered as a compact subgroup of Gi(2n, R) and determine a 
bi-invariant Riemannian metric by giving it explicitly on T,(U(n)) as in 
Example 8.6. 

3. Let G be a compact Lie group with a bi-invariant Riemannian metric 
and let X., Y., Ze be vectors at the identity. Compute R(X, Y): Z by 
extending them to left-invariant vector fields on G and using 
Theorem 8.12. 


9 Some Examples 


Except for Euclidean space itself, the examples we have given of symme- 
tric spaces have been compact manifolds. We will consider a further exam- 
ple, which is not compact. To do so we must begin, in a rudimentary way at 
least, to develop some additional theory which will show the path toward 
further examples—in fact toward all examples of symmetric spaces. 

As we have noted, symmetric spaces are acted upon transitively by their 
group of isometries. It is natural, therefore, to ask under what circumstances 
can one be sure that a manifold M, acted on transitively by a Lie group G, 
can be endowed with a Riemannian metric relative to which the transforma- 
tions of M by elements of G are isometries. A sufficient condition is given by 
the following theorem. 


(9.1) Theorem Let G be a Lie group acting transitively on a manifold M. 
Then M has a Riemannian metric such that the transformation determined by 
each element of G is an isometry if the isotropy group H of a point pe M is a 
connected compact (Lie) subgroup of G. 
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Proof We let 6:G x M >M denote the action, and for each ge G, 
6,:M —>M denotes the diffeomorphic transformation of M onto itself 
determined by g, 8,(q) = 0(g, q). If g € H, then 6,(p) = p so that it induces a 
linear mapping 6,,: 7,(M)— T,(M). Since 6,,°6,, = Ogg, we have 
Box ° 992% = Ozga SO that g > 6,, is a homomorphism of H into the group 
of linear transformations on T,(M). From the fact that 6 is C% it is easily 
verified that this is a C” homomorphism, that is, a representation of H on 
T,(M). Referring to Section VI.3 and, in particular, Theorem VI.3.9, we see 
that since H is compact and connected, there must be an invariant inner 
product, which we shall denote by ®,(X,,, Y,) on T,(M). Now if qe M, there 
is a ge G such that 0,(q) = p. We define ®,(X,, Y,) by 


D(X, > Y,) = OFD(X, y) = 9,(0,,Xq > 85x Yq): 
If 6,,(q) = p also, then gg, * € H. Hence 6%,,-.0, = ®, and 
0P, = 05,95. 1-19, = OF, z Ces 2 039, = 039, . 


91°99 
It follows that ®, is well defined; it is positive definite since 0, is a 
diffeomorphism; and it is easily verified that ® is C” and G-invariant on M. 
Thus ® defines a Riemannian metric on M with respect to which each 8, is 
an isometry of M. This completes the proof. I 


In the following theorem we will continue this notation, and suppose as 
above that H is compact and connected and moreover that the action of G 
on M is effective. Then we are able to impose an additional condition which 
will be sufficient to ensure that M, with a metric which makes G a group of 
isometries, is a Symmetric space. This will open the way to further examples, 
of which we give only one in detail. 


(9.2) Theorem With G, H, p and M as above suppose that «: G > G is an 
involutive automorphism of G whose fixed set is H. Then the correspondence 
&(6(g, p)) = O(x(g), p) defines an involutive isometry of M onto M with p as an 
isolated fixed point. 


Proof First we check that & actually defines a mapping of M onto itself. 
Let q be an arbitrary point of M. By transitivity there is at least one g € G 
such that 6(g, p) = q. If g' is a second such element, then g’ = gh and a(g') = 
a(g)a(h) = a(g)h. Hence 


à(Olg', p)) = O(a(g’), p) = O(a(g)h, p) = O(a(g), O(h, p)) = O(a(9), p) 


as required. Therefore & is defined independently of any choices. Since &? is 
the identity, & is onto. Let us assume for the moment that we have proved 
that & is C”, has p as an isolated fixed point, and that %,: T,(M) > T,(M) is 
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—I, that is, @,(X,) = —X,,. Then, clearly, &, preserves the inner product ®, 
on T,(M). IfqeM, q +p, "then choose ge G such that 6,(p) = q. Then 


%(q) = (alg), p) = z (9g-1(q))- 


Hence %,,: T,(M) > Tyg(M) is given by &,, = Oae ° 94-14, both of which 
are isometries on the tangent spaces. Thus, subject to checking the other 
properties, & is an isometry. 

In order to verify the remaining properties we need to use the fact that 
the natural identification of M with G/H given by the mapping F:G/H 
> M, F(gH) = 6(g, p), is C” and commutes with left translation on G/H. 
Thus we use Section IV.9, which was an application of Frobenius’ Theorem 
[although, in fact, in the examples given below the facts we need here can be 
checked directly without relying on this general procedure]. First we recall 
that if gH € G/H, then there is a C® section S defined on a neighborhood V 
of gH, S: V > G with mo S = id (x: G > G/H is the natural projection and 
id the identity on V). Using the diffeomorphism F, obtain a C” section 
§= S F`! on = F(V ) into G which means a C® mapping such that 
a(S(q) , p) = q for all q € V. Every point of M is contained in the domain V of 
such a section, and &| V is given by 


a(q) = &(0(S(q), p)) = 9(2(S(4)), p} 


which is a composition of C” mappings. It follows that & is C®. 

Finally we wish to show that & has p as an isolated fixed point and that 
&yp = —I. We use facts demonstrated in Section IV.6 concerning the expon- 
ential mapping (Definition IV.6.8) 


exp: T,(G) > G 


(not to be confused with Exp, the exponential mapping of Riemannian 
manifolds). Given any X, e€ T.(G), then exp tX, = g(t) is the one-parameter 
subgroup of G with g(0) = X.; and exp X, = g(1). By Theorem IV.6.10, 
there is an e > 0 such that an e-ball B%(0) = T,(M) is mapped diffeomor- 
phically onto a neighborhood U of e, the identity of G. Since «: G > G is a 
Lie group automorphism with g? the identity, «,: T.(G) > T,(G) splits T.(G) 
into the direct sum of two subspaces V* of characteristic vectors belonging 
to the characteristic values +1 of a,. Since a(exp tX.) = exp ta,(X,), 
a,(X,)=X, if and only if X,eT7,(H). Thus T,(G)=V*t*@V-, 
V* =T(H). 1:G—>G/H defines n}: T,(G) > Ty.(G/H) with ker z, = 
TH) and 2, |V~ an isomorphism onto. It follows that z» exp maps a 
neighborhood W of V~ ^ B?(0) = T,(G) diffeomorphically onto a neighbor- 
hood of H in G/H. Composing with F: G/H > M gives a diffeomorphism 
onto an open set around p. Thus for X, € W, the mapping X, > O(exp X., p) 
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is a diffeomorphism. Moreover 4&(6(exp X., p)) = O(a(exp X,), p) = 
O(exp(— Xe), p). It follows that p is the only fixed point of & in this neighbor- 
hood and that &,: T,(M) > T,(M) is —I; each vector is taken to its negative. 
Combining this with what we have already established, the proof is 
complete. | 


The following corollary is immediate, since each 0,: M >M is an 
isometry. 


(9.3) Corollary Under the assumptions of the theorem M is a symmetric 
space with involutive isometries o, = % and o, = 0,° Qo 6,-1 for q = O(g, p). 


Much of the above is to enable us to consider somewhat more com- 
plicated examples of Riemannian manifolds, of which the following is a 
sample. 


(9.4) Example Let M be the collection of all n x n, symmetric, positive 
definite, real matrices of determinant + 1, and let G = Si(n, R) be then x n 
matrices of determinant +1. Then G acts on M as follows: 


Olg, s) = gsg', 
where g' denotes the transpose of g €,Sl(n, R). We will let p, the base point of 


the theorems above be I, the n x n identity. We then note that H = SO(n) 
since 


H = {ge SIn, R)| (9,1) = 1) 


is given by the equivalent condition gg’ = I, that is, g € SO(n), the group of 
orthogonal n x n matrices. Hence M is canonically identified with 
Sl(n, R)/SO(n). 

The automorphism « which we consider is defined by a(g) = (g~ +y, the 
transpose of the inverse of ge Si(n, R). Note that a(g) = g if and only if 
g € SO(n). Thus all of the conditions of the theorem are met if Sl(n, R) is 
transitive on M. However, any positive definite, symmetric matrix q may be 
written in the form q = gg’ = gIg' where g € Sl(n, R) by standard theorems 
of linear algebra. From the corollary above M is a symmetric space relative 
to an Si(n, R) invariant metric. [Note that a: M > M can be seen, quite 
directly, to be C” and to have the identity p = I as its only fixed point on M. 
In fact, using q = sIs’, we see that 


à(q) = à8(s, I) = Os Aas et ary tag 


Thus č: M — M simply takes each positive definite symmetric matrix to its 
inverse. The only such matrix which is equal to its own inverse is the 
identity /.] 
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(9.5) Example A variant on the above, which is a particularly important 
case, is the following. Let M = {(x, y)e R*|y > 0}, the upper half-plane. 
[Note that it is covered by a single coordinate neighborhood.] We define an 
action of SI(2, R) on M as follows. We identify R? with C, the complex 


numbers, in the usual way. Let z = x + iy and let w = u + iv, i = ./—1. 
When g e€ SI(2, R), that is, 


a=(¢ a ad— bc = +1, 
c d 


we then define w = O(g, z) = (az + b)/(cz + d). It is not difficult to verify 
directly that if y = Im(z) > 0, then v = Im(w) > 0 and that 0(g,, 0(g>, z)) = 
(9192, 2). Moreover the Riemannian metric defined (in the local coordinates 
(x, y)—or z = x + iy—which cover M) by the matrix of components 


I l 


y? _ | me) 
| 1 
0S rE TEENY) 
Y (Im(z))? 
is invariant under the action of SI(2, R); thus this group acts on M as a group 
of isometries of this metric (Exercise 4). 

If we let the complex number i which corresponds to (0, 1) in R?, play the 
role of p in the general discussion above (Theorems 9.1 and 9.2), we note the 
following two facts. First, the action is transitive. Given any Zo = u + iv with 
v > 0, then an element of G = SI(2, R) taking i to zp is 


ee 


(gi) = 


giving, in general, 


oz + (uo) 


Ag, z) = = Uz + H; 
Oz + (1/,/n) 
and, when z = i, 
O(g, i) =u + iv. 


Second, the isotropy group of i consists of all g = (4 4)eSi(n, R) such 
that i = (ai + b)/(ci + d). However, this means that ai + b = —c + di or 
a = d and b = —c. Since in addition ad — bc = +1, we have also a? + b? = 


1; hence 
E cos sin 
g= —sin@ cos@ 
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and H = SO(2). It follows from our general theory in Section IV.9, that the 
upper half-plane with this geometry and the 2 x 2 positive definite matrices 
of Example 9.4 are equivalent both as manifolds and as homogeneous spaces 
with Si(2, R)/SO(2). This shows that the identification of a homogeneous 
space with a coset space of a Lie group as a prototype is a deeper and more 
interesting result than it might appear to be. In many cases rather concretely 
given geometric spaces can best be studied in the context of coset spaces of 
Lie groups. 

The example we have been considering, the upper half-plane, is a realiza- 
tion (due to Poincaré) of the space of non-Euclidean geometry discovered by 
Bolyai, Lobachevskii, and Gauss. Its geometry can be studied using results 
of this chapter. For example, we have earlier asked the reader (Exercise 5.4) 
to check that the lines x = constant are geodesics in this geometry. Now we 
propose another problem: Show that the upper halves of circles with centers 
on the x-axis are—when suitably parametrized—also geodesics. This is done 
by showing that each such circle is an image by one of the isometries of G of 
a vertical line. Moreover, since through a point z there is such a circle 
tangent to any direction, these must be all of the geodesics. Using this fact it 
is easy to see that Euclid’s postulate of parallels does not hold in this 
geometry: There are more than one, in fact an infinite number of lines 
through a point z not on the line L which are parallel to L. that is. do not 
intersect L at any point of the upper half-plane M. The possibilities are 
shown in Fig. VII.15, where L} and L, indicate parallel lines (geodesics) 
which bound the infinite collection (faint lines) of lines £ parallel to L 
through z. 


axis ' : 
¥ om Li M L2 


Y 
MMM Mh 
Figure VILIS 
(9.6) Example As a last example of a symmetric space, we mention the 


Grassmann manifold G(k, n) of k-planes through the origin of £". We have 
noted in Section IV.9 that this is a homogeneous manifold acted on in a 
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natural way by Gi(n, R). It is easy to see that the subgroup SO(n, R) also acts 
transitively on the k-planes in R”. In fact, a k-plane contains an orthonormal 
basis f,,...,f, which can be completed to an orthonormal, oriented basis 
f\,...,f, of R". Then there exists an orthogonal transformation of determin- 
ant +1 taking the standard basis e}, ..., e, to this one. Hence the k-plane Po 
spanned by e;, ..., €, is carried onto any k-plane P by at least one element of 
SO(n, R) acting in the natural way. The isotropy group H of Pp is S(O(k) x 
O(n — k)), the matrices in SO(n) of the form 


A 0 
l p) AeO(k), BeO(n—k), det Adet B= +1. 

We shall not pursue this example further except to mention that in this 
case « is the automorphism «: xt gxg~' determined by the element 


D -I, 0 
os 0 l-k 
of Gi(n, R); then a(x) = x if and only if xe H. 


Many further details on this and other symmetric spaces may be found in 
Helgason [1]. 


Exercises 


1. Prove that any positive definite, symmetric n x n matrix P is of the form 
P = AA’, where A is a nonsingular matrix of determinant + 1. Find all 
possible 4 such that P = AA’ for a given P. Show that conversely 44’, A 
ann X n matrix, is positive definite if A is nonsingular. 

2. Show that if P is a positive definite symmetric matrix and P = P~ +, then 
P=]. 

3. Ifa, b,c, d are real, compute the imaginary part of (az + b)/(cz + d)and 
show that it has the same sign as Im(z). 

4. Prove that SI(2, R), acting as in Example 9.5, leaves the Riemannian 
metric given there invariant, that is, each transformation of this type is 
an isometry of the upper half-plane. Show that under these isometries 
circles with center on the x-axis go into circles of the same type, or 
vertical lines. 

5. Show that the mapping A —> e^ maps the symmetric n x n matrices onto 
the positive definite symmetric matrices and is one-to-one and onto. Use 
this to show that curves through J given by P(t) = e'4, A symmetric, 
n x n, allow us to identify T,(M), the tangent space at I to the manifold 
M of positive definite symmetric matrices of determinant +1 with the 
symmetric n x n matrices of trace zero. (See Example 9.4.) 
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6. Show that (A, B) = tr(AB) defines an inner product on the space of 
symmetric n x n matrices. Using Exercise 5, show how this may be used 
to define a Riemannian metric on M which is invariant under the action 
0: Sl(n, R) x M —> M of Example 9.4. 

7. Verify that « defined in Example 9.6 is an involutive automorphism of 
SO(n) leaving fixed the subgroup H. 

8. Show that in Example 9.5 there is a unique geodesic segment joining any 
pair of points p, q and that its length is the distance d(p, q) as defined in 
Section V.3. 

9, Show that the mapping A —> e4 of M(R) > SI(2, R) is not onto. 


Notes 


The proper generalization of differentiation from Euclidean manifolds to Riemannian 
manifolds was difficult to discover and came long after the work of Gauss and Riemann. The 
notion of parallel displacement of vector fields along curves generally attributed to Levi- 
Civita [1] furnished the basic idea from which the theory developed. The use of the operator Vy 
and its axiomatization are much more recent and are due to Koszul {1]. Many of the references 
contain a more complete theory of connections and of differentiation on manifolds which does 
not depend on a Riemannian metric—and hence is not unique. 

The curvature, which is introduced so briefly here, is in some sense the obstacle to differen- 
tiating exactly as in Euclidean space, for there parallel displacement of a vector field along a 
curve from p to q is independent of the path chosen. In the Riemannian case, however, it is not 
the same along every curve. Thus parallel displacement of T,(M) along a closed curve (loop) at 
p yields the identity transformation of T,(M) if we are in Euclidean space, and a linear transfor- 
mation related to the curvature operator otherwise. 

Once differentiation is successfully generalized, one can begin the study of Riemannian 
geometry itself. We began this with the study of geodesics and, in the next chapter we shall go 
on to a brief study of curvature. Of course, we do not dig very deeply; both geodesics and 
curvature as well as their interrelations are the basis for an enormous amount of interesting 
research. Indeed, geodesics and curvature are the basic objects of study in Riemannian 
Geometry. The reader should consult Bishop and Crittenden [1], Kobayashi and Nomizu [1], 
Milnor [1], and other books listed in the references for further work and extensive bibliographies 
of these topics. 

Symmetric spaces are among the most important examples of Riemannian manifolds. In 
particular, they include as special cases all of the classical geometries: Euclidean, non-Euclidean, 
projective (real and complex); they were first studied by Elie Cartan in several important papers 
and were completely classified by him. They are all homogeneous spaces of Lie groups and 
indeed their classification is closely related to the classification of real, semisimple Lie algebras, 
also carried out by Cartan. The reader should consult Chern and Chevalley [1] and Helgason 
[1] for more details. Some further examples are discussed in Chapter VIII. 
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This chapter continues our brief introduction to Riemannian geometry by defining and 
interpreting the concept of curvature. This is the most important invariant of a Riemannian 
metric on a manifold and completely determines the local geometry. Its definition requires the 
operation of differentiation developed in Chapter VII. 

We begin with a brief exposition of the geometry of surfaces in E°, that is, two-dimensional 
submanifolds imbedded in ordinary Euclidean space; E? comes equipped with a Riemannian 
metric and thus induces one on the surface. Using differentiation of vector fields along curves on 
the surface M we are able to define a symmetric bilinear form (covariant tensor of order 2) on M 
which is related to the shape of the surface, and a corresponding symmetric (self-adjoint) 
operator on the tangent spaces to M whose trace and determinant are the mean and Gaussian 
curvatures. The latter, denoted by K, is of fundamental importance because of the profound 
discovery of Gauss that it is unaltered by modifications of the manner in which M is imbedded 
so long as lengths of curves (and hence the Riemannian metric) are unaltered. This is not proved 
until Section 4; but in Section 3 we deduce the basic symmetry properties of the Riemannian 
curvature R(X, Y, Z, W), a covariant tensor of order 4 which was defined in Section VII.4. 
These properties are then used in Section 4 to prove Gauss’s theorem and to determine the 
relation of the Gaussian curvature and the Riemannian curvature. This involves the important 
idea of sectional curvature in an arbitrary Riemannian manifold, which is defined and discussed 
in Section 4, using the various symmetries of R(X, Y, Z, W). 

In Section 5 we extend the differentiation process previously defined for vector fields to 
arbitrary covariant tensor fields and use it to define the notion of parallel vector fields. We state 
(without proof) the local characterization of symmetric spaces as Riemannian manifolds whose 
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curvature tensor is parallel. This includes, in particular, the manifolds whose curvature is 
constant on all sections, called manifolds of constant curvature. The last section is devoted toa 
discussion of these Riemannian manifolds; they include all of the classical geometries: 
Euclidean, hyperbolic or non-Euclidean, and elliptic (real projective space). 


1 The Geometry of Surfaces in E’ 


In this section we use our earlier definitions of curvature for curves in 
Euclidean three-dimensional space to obtain various quantities which mea- 
sure the shape of a surface M near each of its points. All of these will depend 
for their definition on the fact that the surface M lies in Euclidean space; 
however they will be independent of the coordinates used both on M and on 
E’, as will be seen from their definition. Since all properties are local in 
character, we suppose that M is an imbedded surface of which we consider 
only a portion covered by a single coordinate neighborhood U, œ with 
W = ọ(U) a connected open subset of R?, the uv-plane. Thus pe U = M 
has coordinates (u(p), v(p)) = o(p); and, taking the Euclidean three- 
dimensional space with a fixed Cartesian coordinate system, that is, identify- 
ing E’ with R°, the imbedding or parameter mapping 9 ': W > U c R? is 
given by x' = f‘(u, v), i = 1, 2,3. Let E, = o,'(6/du) and E, = ø% '(6/dv) 
denote the coordinate frames and suppose further that M is orientable and 
oriented with U, ọ giving the orientation. This is an important condition on 
M since we are then able to define, without ambiguity, the unit normal vector 
field N to M; it is the unique unit vector at each pe M which is orthogonal 
to T,(M) = T,(R°) and so chosen that E,, E,, N form a frame at p with the 
same orientation as 6/0x', 6/dx”, 6/dx* the standard orthonormal frame of 
R?. Length and orthogonality are defined in terms of the inner product 
(X, Y) of Euclidean space which, of course, induces a Riemannian metric on 
M by restriction. We shall study the shape of M at pe M by means of the 
derivative of N in various directions tangent to M at p. 

In fact, using the ideas developed in Sections VII.1 and VII.2, let p(t) be 
any differentiable curve on M with p(0)= p and p(0) = X,¢€ T,(M). Re- 
stricting N to p(t) gives a vector field N(t) = N „n along p(t) which may be 
differentiated in R? as a vector field along a space curve, giving a derivative 
dN/dt which is itself a vector field along p(t). Applying (VII.1.4c) and using 
(N, N) = 1, we have 

d dN 
0= 5 (NN) = (5 N). 
This means that dN /dt is orthogonal to N(t) at each point p(t) and hence is 
tangent to M, that is, dN/dt € T,» (M) (see Fig. VIII.1). 

If we restrict our attention to a fixed point pe M, and consider various 
curves through it with p(0) = p and tangent vector X, = p(0), then we have 
the following result. 
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Figure VIII.1 


(1.1) Theorem The vector (dN/dt),- 9 depends only on X, and not on the 
curve p(t) chosen. Let S(X,) = —(dN/dt),-9. Then X, > S(X,) is a linear 
map of T/M) > T,(M). 

Proof Let X, = aE, + bE, be an arbitrary element of T,(M) written 
as a linear combination of the coordinate frame E,,, E, of the coordinate 
neighborhood U, » containing p. Let 


= (f (u(t), v(t), f (ult), v(t), f (ult), v(t))) 
be any differentiable curve with p(0) = p, p(0) = X, and suppose p(0) has 
coordinates ug = u(0) and to = v(0). Since p(0) = sP , we have p(0) = 
aE,, + bE>,, that is, u(0) = a and i(0) = b. We denote ‘by ni(u, v) the com- 
ponents of N on U relative to the standard frames in R? 


ô 
+ n?(u, o) + w(u, EN 


N =n! 
n' (u, v ax3 


fa) 
FE 


Then, along the curve 


and 


2) - £2 oo (2) old 
AOLA 


This shows that S(X,) depends linearly on the components of X,,, and since 
(dN/dt),—o lies in T,(M), we have S: T,(M)— T,(M) is a linear map. More- 
over only the values (u( 0), v(0)), the coordinates of p, and u(0), 0(0), the 
components of p(0) = X,, appear in the formula. Thus (dN/dt)) depends on 
pand X, and not on a curve used in the calculation. i 
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(1.2) Remark The linear map S: T,(M) —> T,(M) given at each pe M does 
not depend on the choice of coordinate system U, œ on M nor on the 
Cartesian coordinate system used in Euclidean space. This is because N is 
defined using only the orientations of M and Euclidean space and the inner 
product of the Euclidean space; the differentiation depends only on the 
existence of parallel orthonormal frames in Euclidean space. Thus N, dN/dt, 
and S are independent of coordinates and involve only the geometry of M as 
an imbedded surface in Euclidean space. The operator S has been appro- 
priately called the shape operator, see O'Neill [1], in whose work the reader 
may find a detailed discussion. 


By way of examples, suppose M is the xy-plane. Then N = E}, a con- 
stant vector, so that S(X,) = 0. On the other hand if M is a sphere of radius 
R, the unit normal N at (x', x*, x*)e M is given by 

x! ô r x? ô m x? ô 
~ Rôx! R dx?” R 6x? 


N 


If we move in any direction tangent to the sphere along a great circle curve, 
parametrized by arclength so that ||X,||=1, then S(X,)= —dN/ds 
= —(1/R)X,. Further examples will be considered presently. 

We may use the linear map S: T,(M)—> T,(M)—more accurately 
denoted S,—which we have determined at each pe M to define a C” covar- 
iant tensor field on M, assuming (as we shall henceforth) that M is a C® 
submanifold. We follow a standard procedure from linear algebra: Let 
S:V— V be a linear operator on a vector space V with inner product 
(X, Y). Then the formula 


Y(X, Y) = (S(X), Y) 
defines a bilinear form, or covariant tensor of order 2, on V. The form ¥ is 
symmetric if and only if 

(S(X), Y) = (X, S(Y)) 
holds for all X, Ye V; S is then called symmetric or self-adjoint. For the 


linear algebra involved the reader is referred to Exercise 9 and to Hoffman 
and Kunze [1]. 


(1.3) Theorem S(X) is a symmetric operator on the tangent space T,(M) for 
each pe M and (X,Y) is a symmetric covariant tensor of order 2. The 
components of S and ¥ are C” if M is a C” submanifold. 


Proof In order to prove these statements we compute the components 
of (X,Y). As above U,ọ is a coordinate neighborhood and 
o`t: W > U c M is the corresponding parametrization. The components 
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of Y(X, Y) relative to the coordinate frames E, = yx '(6/6u) and E, = 
ex ‘(@/dv) are given by the standard formulas below, in which we use 0N/du 
and dN/dv to denote the derivatives of N along the coordinate curves on M 
obtained by holding one corrdinate fixed and allowing the other to vary (as 
parameter along the curve): 


ôN 
Y(E,, E1) = (S(E,), E1) = [a E,), 


ON 
F(E, E2) a (S(E,), E,) = H Es), 


ON 


F(E, E,) = (S(E2), E,) E a E,), 


ON 
YE, E,) = (S(E2), E2) = = Bs) 


If we denote by X = X(u, N the ae vector from 0 to ` '(u, v) 


X = f(u, Di pth" D z + hu, D, 
then X, = E, and X, = E, are just the vectors whose components are the 
corresponding derivatives of the components of X with respect to u and v, 
that is, X, =ôX/ðu = E, and X, = ôX/ðv = E,. Remembering that 
(N, X,) = 0 = (N, X,) and differentiating, we obtain 


ON afi 
-(5 > (N X ) Lng 
ôN oo Of _ (aN 
=A = (N, X ou) = mig = N Xw) = -(F X); 
ôN E. 
-maag 


These computations show that the components of ¥, and hence of S, are C” 
if M is. The second of these relations shows that P(X, Y) = ¥(Y, X) so the 
tensor ¥ is symmetric. The 2 x 2 matrix (I,;) = (Y(E;, E ;)) of its compo- 
nents will often be written 


where | = (N, X u) = hi, m = (N, Xw) = h2 = lhi, and 
n= (N, X w) T l2 * 
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The bilinear form Y(X, Y) is called the second fundamental form of the 
surface M, and the inner product (X, Y) the first fundamental form. Although 
the components of the Riemannian metric (X, Y) are denoted by g;; for the 
general Riemannian case, one often uses E, F, G in the classical case of a 
surface M in Euclidean space. Thus 


gi = E = (Xa, Xa) 
Ji2 = F = (Xa, X.) = (Xa X,) =F= 921» 
g22 = G = (X,, Xo). i 


(1.4) Remark It is a classical theorem of differential geometry (which we 
shall not prove) that two surfaces M, and M3 in R? are congruent if and only 
if they correspond in such fashion that at corresponding points both fun- 
damental forms agree (O’Neill [1, p. 297; Stoker [1, p. 138]). Of course the 
“only if” part is immediate from the definitions. This fact shows the impor- 
tance of these two forms in the geometry of the surface. 


The Principal Curvatures at a Point of a Surface 


Having once proved that S(X) is a self-adjoint linear operator on T,(M) 
at each pe M, we can use standard theorems of linear algebra, together with 
what we have learned of curves in space, to study the geometry of M. 


(1.5) Theorem At each pe M the characteristic values of the linear trans- 
formation S are real numbers k, and kz, k; > kz. Ifk, # kz, then the charac- 
teristic vectors belonging to them are orthogonal; if k, = kı = k at p, then 
S(X,) = kX, for every vector X, in T,(M). The numbers k, and k, are the 
maximum and minimum values of Y(X p, Xp) = (S(X p), Xp) over all unit vec- 
tors X, € T,(M). 


Proof These statements are taken directly from theorems of linear alge- 
bra, but we shall sketch a proof for the case k, # kz, leaving the case 
kı = k, to Exercises 7 and 8. All vectors are elements of T,(M), p fixed, in the 
following proof. We suppose k, > k; are the characteristic values, which are 
real since S is self-adjoint (Exercise 7), and we let F,, F, be characteristic 
vectors of unit length corresponding to k,, k2. We have 


ki (Fi, F3) = (S(F 1), F,) = (Fi, S(F2)) = k(Fj, F3), 
which implies (F;, F2) = 0 when k, # k2, as assumed. Replacing F3 by 
— F, if necessary, we may suppose F,, F, is an orthonormal basis with the 
same orientation as T,(M). 


Next we show that k, and k, are the maximum and minimum values of 
(S(X,), Xp) for unit vectors X,. Any unit vector X „€ T,(M) may be written 
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X, = cos 0 F, + sin 0 F,. Let k(8) denote (S(X,), X,) = ‘¥(X,, X,). Since 
F, F, is an oriented, orthonormal! frame, we have 


(*) k(0) = k, cos? 0 + k, sin? @ — (Euler’s formula). 
Differentiating gives 

dk ; 

aa 2(k, — ky) sin 8 cos 0. 


Hence the extrema of k(@) occur when 8 = 0, 4n, n, or 3x; in other words, 
when X,= +F, or +F, so that k, and k, are maximum and minimum 
values of (S(X,), X,) as claimed. I 


We remark that k, and k, are the maximum and minimum of the expres- 
sion Y(X,, X,)/(X,, Xp) over all X, #0 in T,(M). The points p at which 
kı = k, are called umbilical points of M if k, #0 and planar points other- 
wise. Note that a sphere of radius R consists entirely of umbilical points with 
kı = 1/R=k,. Similarly, if M is a plane, every point is planar with 
k =0=k. 

We shall now interpret k(0) = Y(X ,, X,) geometrically. Let p be a point 
of M and X, a unit tangent vector at p; X, and N, determine a plane on 
which we may take p as origin and X,, N, as unit vectors along the axes (in 
that order), thus giving a coordinate system and orientation on the plane 
(see Fig. VIII.2). The plane intersects M along a curve which, of course, lies 


Figure VIII.2 


on M and on the plane, and passes through p. It is called the normal section 
at p determined by X,; there is clearly such a curve for each X p: Lhe vector 
N, is the normal to the curve at p and X, is its unit tangent vector. Writing 
this curve as p(t) with p(0) = p and with arclength as parameter, we have 
p(t) = dp/dt a unit vector for every t so that p(0) = X,. Differentiating 
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(N, dp/dt)=0 along the curve, we find that (dN/dt, dp/dt) = 
— (N, d?p/dt?) = —k, the curvature of the plane curve p(t) as defined at 
the end of Section VII.1. In particular, at p= p(0), (dN/dt, X,)= 
—(S(X, ), Xp). Thus with X, = cos 6 F, +sin 0 F, as above, we find that 
k(0) = k is the curvature of the normal section determined by X,. For this 
reason k() is called the normal curvature (of the section determined by 
X,); and k, and k, the maximum and minimum of k(0), are called principal 
curvatures at p and the corresponding unit vectors F ,,. F 2, (chosen to con- 
form to the orientation) are called principal directions at p. 

To study the surface at p we will now choose an xyz-coordinate system in 
Euclidean space so that the origin is at p, T,(M) is the xy-plane, and the 
principal directions F,,, F2p and unit normal N, at p are 0/0x, o/éy, 0/éz, 
unit vectors on the x-, y-, z-axes, respectively. Let x =u, y = v, and 
z = f (u, v) be the (parametric) equation of the surface. Then we may identify 
the xy- and uv-planes and assume that the parameter mapping g ° takes 
some open set W on the xy-plane onto an open set U on M. The conditions 


then imply 
f (0,0) =0 and =f, (0,0) = 0 = f,(0, 0). 


If we compute the components of the first fundamental form at p, we 
obtain E = 1 = Gand F = 0. For the second fundamental form, recall that 
ari f ý ; 
o7: (x, y) > (x, y, f (x, y)) is the parametric representation of M and thus 
at p, 1 = (/ðz, fsx 6/02) = fsx, m = (6/02, fry 0/02) = fry, and 


n = (8/62, fry 6/62) = fry - 


Now the fact that we have chosen coordinate axes so that ô/ôx and 6/dy 
are principal directions tells us that m = 0 and! = k,,n = k,. Thus we have 


k(O) = fex cos? 0 +f, sin? O at x=0, y=0. 
Let f (x, y) be expanded in Taylor series at (0, 0). Then 
z = f (x, y) = fex(0, 0)x? + f,(0, O)y? + Ro, 


where R, contains terms of higher order. Let f,,(0, 0) = a and f,,(0, 0) = b. 
Then we see that the normal sections of z = ax? + by? have the same sec- 
tional curvatures at p as does the given surface. Therefore the quadric sur- 
faces must give typical examples. 


(1.6) Example z= ax? + by’, ab > 0 (see Fig. VIIL3a). This is an elliptic 
paraboloid; the principal curvatures are a and b. If both are positive, it lies 
above the xy-plane; if both are negative, it lies below. In either case when k, 
and k, have the same sign, the surface is (locally) on one side of T,(M). 
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ab >O(a>0, 6>0) ab=0(a>0, 6=0) 
(a) (b) 


x 


ab < O(a >0, b<0) 
(c) 


Figure VIII.3 


(1.7) Example z= ax? + by’, ab = 0 (see Fig. VIII.3b). If both are zero, 
we have the xy-plane as our surface; if one, say b = 0, then we have a 
parabolic cylinder which is above the xy-plane if a > 0. 


(1.8) Example z= ax’ + by’, ab <0 (see Fig. VIII.3c). In this case we 
have a hyperbolic paraboloid or saddle surface with the xy-plane tangent at 
the saddle. Suppose, for example, a = 1 and b = —1. Then by Euler's for- 
mula (x) k(@) = cos? @ — sin? 8 and hence k(@) varies from + 1 to — 1 and is 
zero at +7/4, +32/4. When k; > 0 and k, < 0, then the surface must have 
points (locally) on both sides of T,(M). 


In these exercises we follow the notation of the text. 


Exercises 


1. Show that if pe M is not an umbilical or planar point, then there exist 
coordinates U, œ on a neighborhood of p such that the curves 
u = constant and v = constant are tangent at each point to the princi- 
pal directions. [These curves are called lines of curvatures.] 

2. Let M be the surface obtained by revolving a curve z = f(x) around the 
z-axis. Show that the lines of curvature (Exercise 1) are the circles 
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z = constant on M and the curves obtained by intersection of M with 
planes containing the z-axis. Determine the umbilical points. 

3. Let U, ọ be coordinates such that u = constant and v = constant are 
lines of curvature. Show that in the first fundamental form the compo- 
nent F is zero in these coordinates and that the principal curvatures are 
/E and n/G. 

4. A direction X, at pe M such that P(X,, X,) = (S(X,), Xp) = 0 is 
called an asymptotic direction. Show by example that there may be two, 
one, or no asymptotic directions at a point of M. Find the asymptotic 
directions at each point of a hyperboloid of one sheet, 
(x?/a”) + (y?/b?) — (2*/c”) = 1. 

5. Show that if there are two distinct asymptotic directions at p € M, then 
there exist coordinates U, œ around p such that u = constant and 
v = constant are everywhere tangent to asymptotic directions (they are 
asymptotic lines). Find the asymptotic lines for a hyperboloid of one 
sheet. 

6. For a surface of the form z = f(x, y) find the components of the first 
and second fundamental forms and the directions of the lines of curva- 
ture and asymptotic lines. 


In the following exercises assume V is a Euclidean vector space with 
inner product (v, w) and that S: V > V is a linear operator. 


7. Suppose that S is self-adjoint. Show that its matrix relative to an 
orthonormal basis is symmetric. Ifdim V = 2, use this to show that the 
discriminant of its characteristic polynomial is not negative, so that the 
characteristic roots are real. 

8. When S is self-adjoint and dim V = 2, show that if u # 0 is a character- 
istic vector of S, then so is any v orthogonal to u. Prove, using 
Exercise 7, that if the discriminant is zero, then S is a scalar multiple of 
the identity transformation. 

9. Show that the correspondence $< (S(v),w) is an isomorphism 
between the space of linear operators on V and the space of bilinear 
forms on V. 

10. Give a precise definition of what would be meant by a C” field of linear 
operators on a manifold M. If M is Riemannian, show that the collec- 
tion of such fields is isomorphic in a natural way to 7 ?(M). 


2 The Gaussian and Mean Curvatures of a Surface 


The negative of the trace and determinant of any matrix of the linear 
transformation S defined in Section 1 are the coefficients of the characteristic 
polynomial of S and are important invariants. The determinant is 
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K =k, k, the product of the characteristic values; it is called the Gaussian 
curvature of the surface. The trace is k, + k,, the sum of the characteristic 
values; and H = 4(k; + k,) is called the mean curvature of the surface. These 
quantities may be computed directly from the components of the fundamen- 
tal forms, using any parametrization of the surface. This we now proceed to 
do. 


(2.1) Theorem 


_ hn- nd H 1 GI —2Fm + En 
“EG-F f =o EG-F 


K 


Proof Together, 
S(X,,) = aX, + bX,, S(X,) = cX, + dX, 
give the components of the operator S in terms of the coordinate frames 


E, = X, and E, = X,, naturally given by the parametrization of M near p, 
that is, on the coordinate neighborhood U, œ. Thus we may write 


a b 


d and 2H =a +d. 


K = 


In terms of X,,. X, we have 
K (X, x X.) = S(X,) x S(X,) 
and 
2H(X, x X,) = S(X,) x X, + X, x S(X,), 


where x denotes the cross product of vectors in three-dimensional 
Euclidean space. 

Now note that (X, x X,, X, x X,) = |X, x X,|/? = EG — F? and use 
the fact that for any vectors X, Y, U, V of R? we have the Lagrange identities 


(X,U) (xX, V) 


((X x Y} (U x V))= he U) (¥,V) 


Then we obtain the formulas for K and H by taking the scalar product on 
both sides with X, x X, in each of the equations above. I 


Since the Gaussian curvature K is the product of the principal curvatures 
k, and k,, we see that K > 0 at p if both k, and k, are different from zero 
and have the same sign. This means that either k, > 0 and k, > 0 and the 
curve of each normal section curves toward the normal so that the surface 
lies entirely on the same side of the tangent plane as the normal N, 
sufficiently near the point p, or k < Oand k, < Oand each curve goes away 


PDF pdfFactory Pro www.fineprint.cn 


376 Vill CURVATURE 


from the normal so that the surface (near p) lies entirely on the opposite side 
to N,. Equivalently, introducing local coordinates in R? as in Examples 
1.6-1.8, K > 0 if and only if the function z = f{x, y) has a strict relative 
extremum at the point. 

On the other hand, if K < 0, then k, and k, are different from zero and 
have opposite signs. This means that the surface is like a saddle surface: 
some normal sections are concave toward the normal N and some concave 
away from it. 

When k = 0 one of the principal curvatures must be zero and then little 
can be said. Two examples, in addition to the plane, are z = (x? + y’)’, 
which is obtained by revolving z=x* around the z-axis, and 
z = x(x? — 3y°), the so-called monkey saddle, which is similar to the usual 
saddle surface except that there are three valleys running down from the 
pass: two for the monkey’s legs and one for its tail (Fig. VIII.4). 

The mean curvature will be of less concern to us than the Gaussian 
curvature for reasons that will appear later. Surfaces for which the mean 
curvature. vanishes are of special interest, however. They are minimal sur- 
faces; they are like the surfaces formed by a soap film stretched over a wire 
frame (Fig. VIII.5). They have the defining property of being surfaces of 


Figure VIII.4 Figure VIII.5 


Monkey saddle. Minimal surface. 


minimal area among all surfaces with a given boundary (the wire frame). 
Thus, in a sense, they generalize the geodesics—curves of minimal length 
joining two fixed points. Like the equation of geodesics, the vanishing of the 
mean curvature guarantees the property of minimality only in a local sense. 


(2.2) Example We consider a torus; then intuitively we can see that the 


two circles running around the torus which are the points of contact with the 
two parallel tangent planes orthogonal to its axis divide the torus into an 
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inner portion on which K < 0 and an outer portion at which K > 0. Along 
the two circles K = 0, since along these circles the normal vector remains 
parallel to the z-axis (Fig. VIII.6). 


Figure VIIL6 


(2.3) Example Let (u,v) — (u,v, uv) parametrize the saddle surface 
z = xy. Then X, = (@/0x') + v(0/dx3) and X, = (2/dx?) + u(@/dx>) from 


which 
E F\ [{14+v? uv 
F G \ w 1+u7} 
Moreover AN = (—v, —u, 1) with A=(1+u?4 07)? and 
Xmu =0= X,,, Xou = 6/6x?. It follows that 


1 

(; iel A 
m n} |1 

fi 0 


From this we obtain 


K= -(;4), ya”. 


The Theorema Egregium of Gauss 


The entire subject of differential geometry was influenced by a very 
profound discovery of Gauss which may be stated as follows: 


(2.4) Theorem (Gauss) Let M, and M, be two surfaces in Euclidean 
space and suppose that F: M, > M, is a diffeomorphism between them which 
is also an isometry. Then the Gaussian curvature K is the same at correspond- 
ing points. 


To see the meaning of this theorem we shall consider some examples. 
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(2.5) Example Let M, be a plane and M, a right circular cylinder of 
radius R in Euclidean space R°. If we roll the cylinder over the plane, we 
obtain a correspondence which does not change the length of curves or the 
angle between intersecting curves, hence it is an isometry. Since K = 0 for 
the plane, according to the theorem the same must be true of the cylinder. 
Note that they do not have the same second fundamental form, that is, l, m, 
and n do not vanish identically for the cylinder. In fact curvatures of the 
normal sections vary from zero to 1/R. This depends on the imbedded shape 
of the surface, but K does not; it depends only on the Riemannian metric 
induced on M. 


(2.6) Example As a second example, let M, be any open subset of the 
sphere of radius R and let M, be a plane. Since K, = 1/R? # Oand K, = 0, 
the theorem implies that there exists no diffeomorphism of M, into M, that 
is an isometry. For example, any plane map of a portion of the globe must 
distort some metric properties (distance or length of curves, angles, areas, 
and so on). [It is interesting to note that Gauss was engaged on a surveying 
commission at the time he discovered his Theorema Egregium (a “most 
excellent theorem”). The reader is referred to the annotated translation, 
Gauss [1], of Gauss’s famous paper for some historical comments.] 

However, there do exist surfaces isometric to, but not congruent to, say, 
the upper hemisphere. Suppose this hemisphere to be made ofa thin sheet of 
brass. It is intuitively clear that we may bend it by squeezing at the edge 
without introducing any creases (see Fig. VIIJI.7). This will give a surface 
isometric to the original since length of curves is unchanged. It follows that 
K is the same at corresponding points; however, the surfaces are not 
congruent: The circular edge has now become an oval. 


Figure VIII.7 


Surface M, isometric to hemisphere M,. 


(2.7) Example Among the more interesting examples of (locally) isomet- 
ric surfaces are the helicoid and the catenoid (Fig. VIII.8). The first surface is 
given parametrically by 


(u, v) > (u cos v, u sin v, v), u>0, ~ <0v< &. 
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Figure VII11.8 
(a) Catenoid; (b) helicoid. 


It is similar in shape to a spiral staircase. On the other hand, the catenoid is 
obtained by revolving the catenary x = cosh z around the z-axis. We may 
parametrize it as 


(z, 8) > (cos 8 cosh z, sin 8 cosh z, z), -m<z<0, 0<0< 2r. 
The isometry between these surfaces is given by 
p= 6, u = sinh z. 


The verification is left as an exercise (Exercise 8). 


We emphasize what by now may be obvious, namely, that Theorem 2.4 
implies that the Gaussian curvature K of a two-dimensional Riemannian 
manifold M is determined by its structure as an abstract Riemannian mani- 
fold, not by its particular embedding into R*. Of course, in our presentation 
of K in the preceding paragraph, the Riemannian metric on the surface is 
given by the imbedding in R?; it is induced by the standard Riemannian 
metric of R?. However, according to Gauss’s theorem, two very different 
(noncongruent) imbeddings of the same surface, say F,: M > M, c R? and 
F: M > M, c R? have the same Gaussian curvature at each point if each 
imbedding induces the same Riemannian structure on M or equivalently, if 
F = F,» F;!:M, > M, is an isometry. This leads to the conclusion that 
were the theorem true, K should be computable (on the coordinate neigh- 
borhood U, @) from the components E, F, G of the first fundamental form 
alone. This is the classical proof. (See, for example, Stoker, p. 139 [1].) The 
present proof takes advantage of the subsequent work of Riemann and uses 
the Riemann curvature tensor, introduced briefly in Section VII.4, together 
with consequences of the fundamental theorem of Riemannian geometry 
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both offspring of the work of Gauss and Riemann. At the same time we shall 
make some first steps toward investing the curvature tensor with geometric 
meaning. 


Proof of Theorem 2.4 We remember that at a point pe M the value of 
the Gaussian curvature K is given by 


o n-m 

— EG — F?? 
where E, F, G and l, m, n are the components of the first and second fun- 
damental forms, respectively, relative to a system of local coordinates u, v in 
a neighborhood U of p. The value of the ratio K is independent of the 


coordinates chosen although E, F, G and |, m, n are not. Let E, = X, and 
E, = X,, where X = X(u, v) gives the surface in R?. Then we have seen that 


ON oN ON oN 
zas 2 = fe ae ee eee ee 
ae (Fr) E) (Gr. e) (Fo) 


and, since E = (F,, E,), F = (E,, E2), and G = (E3, E2), 
EG — F’ = (Ei, E,)(E2 , E2) — (Ei, E3}. 


Since E, F, G are the coefficients of the Riemannian metric, it is enough 
to show that In — m? = K(EG — F?) depends only on the Riemannian 
metric. We shall show that 


In — m = R(E, E,, E, , E,), 
where R(X, Y, Z, W) is the covariant tensor of order 4 defined in 
Section VII.4, in which case K is given by 
(2.8) K = (EG — F?)"'R(E,, E, , Ey, E1) 


R(E;, Ez, E2, E) R 
(Fi, E, )(E2 > E2) = (E,, E,y 


The left side is independent of local coordinates; thus, the right side is also. 
In fact it is easily shown that replacing E,, E, at a point by any pair of 
vectors F,, Fa, spanning the same plane, leaves unchanged the expression 
on the right-hand side of formula (2.8), which we shall prove gives K. This 
expression, defined at each point of an imbedded surface M, is thus indepen- 
dent of local coordinates on M, and moreover it depends only on the 
Riemannian metric. Clearly this is true of the denominator and we recall 
that by definition 


(R(E,, E2) Ez, E,) = (Ve, Ve, E2 = Ve Vg, E2 = Vie, £3 E2 , E,), 
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which depends only on the Riemannian metric by the fundamental theorem 
of Riemannian geometry (Theorem VII.3.3). In fact in our present case, since 
E, and E, denote coordinate frames of local coordinates u, v, we know that 
[E,, E,] = 0, and so we must show only that 


In — m = (Ve, Ve, E2 — Ve, Vg, E2, E1) 


in order to prove the theorem. 

We may compute the right-hand side using the definition of V;,Z, 
i = 1,2 (for any tangent vector field Z), given originally in Section VII.2; 
namely, we take 0Z/éu and 0Z/év, project them to the tangent plane at each 
point of the surface, and obtain DZ/ôu = Vg, Z and DZ/év = Vz, Z. If N 
denotes the unit normal, and E, = X, and E, = X,, then this procedure 
gives 


Ve, E2 = Xw — (N, Xu), Ve, E2 = Xw — (N, Xy,)N. 


Differentiating again and projecting onto the tangent plane (by subtracting 
the normal component of the derivative) gives 


Ve.(Ve, E2) = X vw = (N, Aw), =c N, 
Ve (Ve E2) = Xuw = (N, X w)N, = c,N. 


We need not compute the scalars c, and c, multiplying N since (N, E,) = 0 
so that these terms vanish in the final computation, in which we take an 
inner product of each term above with E,. This yields for R(E,, E>, E,, E,) 


(Ve, Ve, E2 = Ve, Vg, E2 ? E,) = (Xans X.) > (N, X oo)(Nu ’ Xa) 
Gg (X ouw» X,,) + (N, X(N, , X,). 


This must be seen to be equal to the earlier evaluation of In — m? above, 
namely, 


In — m? = (Nu, X (No, Xo) = (Nu»X.)(N,» Xu) 


Since X,,,, = Xuw, applying the identities developed in the proof of 
Theorem 1.3, we have (N, X,,) = —(N,, X,) and (N, X,,) = —(N,, X.) 
This completes the proof. 


This proof provides an interpretation of the Riemann curvature tensor 
for a two-dimensionat Riemannian manifold. Indeed, when F,, F, are 
chosen at pe M so that they are mutually perpendicular unit vectors, then 
expression (2.8), which we have found for the Gauss curvature K, becomes 


K = R(F,, F,, F3, F,). 
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Exercises 


1. Prove the Lagrange identity for the inner product of the cross product of 
vectors of R°: 


(X x Y,U x V) = (X, U)(Y, V) — (X, V)(Y, U). 


2. Show that k, and k, are given by H + (H? — KY”. 

3. Show that if K > 0 at p, then there are no asymptotic directions, and if 
K < 0 at p, then there are two asymptotic directions and the principal 
directions bisect the angles made by the asymptotic directions. 

4. Show that a surface M is minimal if and only if there are two asymptotic 
directions at each point and they are mutually orthogonal. 

5. For a surface of revolution formed by revolving z = f(x) around the 
x-axis determine when K > 0, when K = 0, and when K <0. Give a 
sufficient condition that the surface be minimal. 

6. Verify that a diffeomorphism of two Riemannian manifolds which 
preserves lengths of all C’ curves is an isometry, that is, it preserves the 
inner product in the tangent spaces at corresponding points. 

7. Verify that (2.8) is unchanged if E,, E, is replaced at p by F,,, F2p» 
another basis of T,(M). 

8. Verify that Example 2.7 is a local isometry as claimed. 


3 Basic Properties of the Riemann Curvature Tensor 


We have previously (Section VII.4) defined the curvature tensor 
R(X, Y, Z, W) of a Riemannian manifold M. Recall that it is a covariant 
tensor field of order 4 whose value at any point pe M is determined as 
follows: Let X, Y, Z, W be vector fields whose values at p are given, say 
Xp, Yp» Zp, Wp. Then 

R(X,., Y,,2,, W) = (Vx, Vr Z — Vr V xZ — Vix. y2 W,). 
We have shown that this is independent of the vector fields chosen and 
defines a C* covariant tensor field. 

In the same way the vector fields X, Y define at each pe M a linear 
operator, the curvature operator, R(X,, Y,) on T,(M) by the prescription 


R(X,, Y,) Zp = Vx,VyZ — Vy,VxZ — Vix. yZ 


which is—like the curvature tensor—linear in X, Y, Z in the sense of a 
C*”(M) module, that is, if fe C*(M), then 


fR(X, Y): Z = R(X, Y): Z = R(X, fY): Z = R(X, Y) fZ. 
Obviously these objects are related by the equality 
R(X, Y, Z, W) = (R(X, Y): Z, W). 
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As we now prove, the curvature tensor satisfies a number of important 
symmetry relations. 


(3.1) Theorem The following symmetry relations hold for the curvature 
tensor and curvature operator at each point, and hence for all vector fields. 


(i) R(X, Y): Z + R(Y,X)-Z =0, 
(ii) R(X, Y): Z + R(Y, Z): X + R(Z,X)-¥ =0, 

(iii) (R(X, Y): Z, W) + (R(X, Y): W, Z) =0, 

(iv) (R(X, Y): Z, W) = (R(Z, W)- X, Y) 


Proof Relation (i) follows immediately from the formula above which 
defines the operator R(X, Y). The fact that R(X, Y, Z, W) is a tensor, in 
particular, the linearity with respect to C® functions, has the following 
important consequence: It suffices to prove any of these statements for the 
vectors of a field of coordinate frames, say E,,..., E,. However, for these 
vector fields the Lie products [E;, E;] = 0; so if X, Y, Z are chosen from 
among E,,..., E,, then proving (ii) reduces to showing that 


Vx(Vy Z) — Vy(VxZ) + Vy(VzX) — V2(Vy X) + Vz(Vx Y) — Vx(VzY) = 0. 
By definition of Riemannian connection, Vy Y — Vy X = [X, Y] = 0. Using 
this, we find that the terms on the left cancel two by two; this proves (ii). To 
prove (iii) we may show that the equivalent statement, (R(X, Y): Z, Z) =0 
for all X, Y, Z, is true. Again it is enough to do so for X, Y, Z chosen from 


among the vectors of the coordinate frames so that [X, Y] = 0. Applying the 
definitions, we see that 


(R(X, Y): Z, Z) = (Vx(VyZ) — Vy(V_Z), Z) = 0 


if and only if (Vx(Vy Z), Z) is symmetric in X, Y. Now differentiating the 
inner product (Z, Z) with respect to X and Y, we find that 


Y(X(Z, Z)) = 2Y(VxZ, Z) = (Vy(VyZ), Z) + UV yZ, Vy Z), 
from which it follows that 
(Vy(VyZ), Z) = 4YX(Z, Z) — (VyZ, VyZ). 
Since [X, Y] = 0, (XY — YX)f = 0 for any function f, and in particular, 


taking f = (Z, Z), we see that the right side is symmetric in X, Y and so also 
the left. 
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Property (iv) is derived from the first three as follows. We take the inner 
product of (ii) with W and obtain from this three more equations by circular 
permutation of X, Y, Z, W, namely, 


(R(X, Y): Z, W) + (R(Y, Z)- X, W) + (R(Z, X): Y, W) = 0, 

(R(Y, Z)- W, X) + (R(Z, W)- Y, X) + (RW, Y): Z, X) = 0, 

(R(Z, W): X, Y) + (RCW, X)-Z, Y) + (R(X, Z): W, Y) = 0, 

(R(W, X)- Y, Z) + (R(X, Y)- W, Z) + (R(Y, W): X, Z) = 0, 
Adding the first two and subtracting the second two of these and using (i) and 
(iii) yields. 

2(R(X, Y)-Z, W) — 2(R(Z, W): X, Y) = 0, 

which proves (iv). | 


In any coordinate neighborhood U,@ we have coordinate frames 
F,,..., E, and we may introduce (as in Remark VII.4.5) n* functions of the 
coordinates Rx, 1 < i,j,k, 1 < n by the equations 


R(E; , E): E; = 2 Rju E; . 
j 
Similarly we may define the components R;jx of the Riemannian curvature 
tensor by the equations 
Rijki = (R(E, , E,)E;, E;) = Y Ri aGnj > 
h 
where of course g;; = (E;, E;) are the components of the Riemannian metric. 
By linearity both R(X, Y)- Z and (R(X, Y): Z, W) are determined on U by 


these locally defined functions. The preceding theorem may be written in 
terms of components as follows. 


(3.2) Corollary For all 1 < i, j, k,l < n we have 


(i) Rou T Rö =0, 
GD) Rra + Riu + Rii = 0, 
(iii) Rij F R jix =0, 
(iv) Rija = Ruri 


(v) Ripa + Riy + Rij = O- 


We remark that (v) is an immediate consequence of Rij = Ya Ri Gnj 
and (i)-(iv). 

The Riemann curvature tensor (R(X, Y): Z, W) is used to define the 
sectional curvature, which plays an important role in the geometry of 
Riemannian manifolds. At any pe M we denote by z a plane section, that is, 
a two-dimensional subspace of T,(M). Such a section is determined by any 
pair of mutually orthogonal unit vectors X, Y at p. 
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(3.3) Definition The sectional curvature K(x) of the section x with 
orthonormal basis X, Y is defined as 


K(x) = —R(X, Y, X, Y) = —(R(X, Y): X, Y). 

From the symmetry and linearity properties it is easy to see that replac- 
ing X,Y by any pair of vectors X’, Y’, where X = «X’+ BY’ and 
Y = yX' + OY’ gives the relation 

(1/A?)(R(X", Y’): X’, Y’) = (R(X, Y): X, Y), 


where A = «ô — fy, the determinant of coefficients. If X’, Y’ is also an 
orthonormal pair, then A = +1 so that the definition of K(z) is independent 
of the pair used. If it is just any arbitrary linearly independent pair, then 
using A? = (X’, X’)(Y’, Y’) — (X’, Y’)?, we have 


(R(X’Y’) X’, Y’) 
X, XY, Y) - X, YF 


(3.4) K(n) = 


In local coordinates, using (E;, E;) = g;; and the notation above, 


X Rija P'a" B 
2 (GG it = Jug Jippe 


where summation is over i, j, k, l and X’ = }; «'E;, Y' = }; B’E;. 

Although it is not obvious, the symmetry properties of the Riemann 
curvature tensor imply that both (R(X, Y): Z, W) and R(X, Y): Z are com- 
pletely determined for arbitrary X,Y, Z, W if K(x) is known for all 
sections 7. 


K(x) = — 


(3.5) Theorem I[f dim M > 3 and the sectional curvature is known on all 
sections of T,(M), then the Riemann curvature tensor is uniquely determined 
at p. 


Proof Let R(X, Y, Z, W) and R(X, Y, Z, W) be two tensors with the 
symmetry properties of Theorem 3.1 and let A(X, Y, Z, W) be their differ- 
ence. It will also be a tensor with these symmetry properties. Our assump- 
tion is that for all X, Y, R(X, Y, X, Y) = R(X, Y, X, Y), or equivalently, 
A(X, Y, X, Y) = 0. We must show that this implies that A(X, Y, Z, W) = 0 
for all X, Y, Z, W, that is, that A = 0. Let pe M and let X, Y, Z, W, etc., denote 
arbitrary vectors in TM). Then 


0=A(X + Y,Z,X +Y, Z) = A(X, Z, Y, Z) + AY, Z, X, Z), 
so, by (3.1)(iv), 
A(X, Z, Y, Z) = 0. 
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From this, by replacing Z by Z + W, we obtain 
0 = A(X, Z + W, Y,Z + W) = A(X, Z, Y, W) + A(X, W, Y, Z) 
to which we apply (3.1)(iv) and then (i) to obtain 
0 = A(Y, W, X, Z) + A(X, W, Y, Z) = AY, W, X, Z) — A(W, X, Y, Z). 


Renaming the variables so that Y, W, X, Z becomes X, Y, Z, W, this may be 
written as 


A(X, Y, Z, W) = ACY, Z, X, W). 


This says that a cyclic permutation of X, Y, Z does not change the value of A. 
Thus (3.1)(iii) reduces to 


3A(X, Y, Z, W) =0, 


which completes the proof. | 


This theorem does much to establish the importance of the sectional 
curvature in the study of Riemannian geometry. We can also use sectional 
curvature to give a geometric interpretation of curvature in terms of the 
Gaussian curvature K of surfaces. However, to do this we will first need to 
complete our treatment of the equations of structure, which will be done ina 
later section. 

We shall say that a Riemannian manifold M is isotropic at a point pe M 
if the curvature is the same constant K, on every section at p and isotropic if 
it is isotropic at every point. Of course a two-dimensional Riemannian 
manifold is (trivially) isotropic. 


(3.6) Corollary If p is an isotropic point of M and U, ọ is a coordinate 
neighborhood with coordinate frames E,,...,E, and Riemannian metric 
gi; = (Ei, E,), then 


Rijki = —K, (991 = Jag) at p. 


Proof It is easy to check that the right side defines a tensor of order 4 
on T,(M) with the same symmetry properties as R(X, Y, Z, W) and with 
constant value on all sections. The corollary then follows from the unique- 
ness theorem (Theorem 3.5). | 


(3.7) Definition An isotropic Riemannian manifold is called a manifold of 
constant curvature if K, is the same at every point. 


An example is Euclidean space where K, = 0. This concept will be dis- 
cussed more fully in a later section. 
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We saw in Chapter V that there exist algebraic operations on tensors on 
a vector space V which yield new tensors on V. Addition and multiplication 
of tensors as well as the operators of and Z are examples. The systematic 
study of these operations is a branch of linear (or multilinear) algebra. It is 
important to differential geometry because each such operation has an im- 
mediate counterpart in tensor fields on a manifold. This is treated system- 
atically in many of the references, at the very beginning, for example, in the 
books of Sternberg [1] and Kobayashi and Nomizu [1]. We will content 
ourselves with examples showing how the curvature tensor yields other 
related tensors. 

Let R(X, Y, Z, W) denote the curvature tensor on a Riemannian mani- 
fold M. We shall use this curvature tensor to define a (covariant) tensor field 
S(X, Y) of order 2 and a (scalar) function on M. Let pe M and let 
F,,,-.-,F,, be an orthonormal basis at p. Then it is left as an exercise to 
verify that 

n 


S(X».¥)= RUF 


i=1 


wedge pele 


ip? 


(R(F; Xp): Yp» Fip) 


ip > 


iM 


l 


is independent of the choice of orthonormal basis and defines a symmetric, 
C*, covariant tensor field S on M. 


(3.8) Definition The tensor field S(X, Y) is called the Ricci curvature of 
M. If there is a constant c such that 


S(X, Y) = c(X, Y), 


that is, S(X, Y) is a constant multiple of the Riemannian metric on M, then 
M is called an Einstein manifold. The function r on M, defined by 


r(p)= È R(Fip, Fip Fip» Fip) = 2 S(F ips F jp) 
i j=l j=1 

is called the scalar curvature of M. 
Spaces of constant curvature are examples of Einstein manifolds 


(Exercise 6). A further example is given by the corollary to the following 
theorem. 


(3.9) Theorem On a compact Lie group G with a bi-invariant Riemannian 
metric, the sectional curvatures at e (hence everywhere) are given by the 
Jormula 


K(ne) = —R(X., Yo, X., Ye) = +4([X, Y], [X, Y)), 
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where X, Y are an orthonormal pair of left-invariant vector fields spanning the 
section T, at e. The curvature operator is similarly given at e, hence at all 
points by 


R(X, Y): Z = —4[X, Y], Z] 
with X, Y, Z left-invariant vector fields. 
Proof We proved in Theorem VII.8.12, that for left-invariant vector 
fields X, Y, the connection of a bi-invariant metric on G is given by 


Vy Y = 3[X, Y]. Applying first the definition and then the Jacobi identity, 
we obtain 


R(X, Y): Z = Vx(VyZ) — Vy(VxZ) — Vix. yZ 
a[X, [Y, Z)] — a[ Y. [X, z]] — 2[[xX, Y], Z] 
HZ. [x. Y]]= —4[[xX. Y}. Z} 


We also know that for left-invariant vector fields U, V, W on G the follow- 
ing identity holds, according to Lemma VI.8.12, 


(LU, V], W) = (U, [V, W)). 


Thus, if X, Y are left-invariant and are an orthonormal basis at e of x, a 
plane section, the sectional curvature is 


K(x) = —R(X, Y, X, Y) = a([[X, Y], X} Y) = 4([xX, Y]. [X,Y]. 1 


If g is a Lie algebra and X € g, then ad(X) denotes the linear mapping of 
g defined by ad X(Y) = [X, Y]. By Exercise VI.8.8, ad X = 0 if and only if 
X ec, the center of g. We shall say that a compact Lie group G is semisimple 
if the center of its Lie algebra is {0} or, equivalently (Exercise 5), if the center 
of G is discrete. 


(3.10) Corollary Let G be as above and X, Y, Z be left-invariant vector 
fields. Then the Ricci tensor S(X, Y) is given by the formula 
S(X, Y) = —4tr(ad X » ad Y) 


and is positive semi-definite and bi-invariant on G. Each compact semisimple G 
is an Einstein manifold relative to any bi-invariant Riemannian metric. 


Proof Using the formula above we see that the linear operator 
Z > R(Z, Y): X on G is defined at e for the left-invariant vector field by 


R(Z, Y): X = —3(ad X)(ad Y)- Z. 


According to Exercise 3, an alternative definition of S(X, Y) is that it is the 
trace of the linear mapping Z > R(Z, X): Y on the tangent space at each 
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point. Then, since S(X, Y) = S(Y, X), the formula of the theorem holds. On 
the other hand, if F,,..., F„ is an orthonormal basis of left-invariant vector 
fields, then the formula used in the proof above 


(ad X- F;, F) = (LX, Fi], F) = — (F, LX, Fy) = —(Fi, ad X- F;) 


shows that the matrix (a,;) of ad X, relative to this basis, is skew symmetric. 
Hence 


trad X ad X = ) ajay = — } aj. 
i,j ij 


Therefore, 4S(X, X) = —trad X ad X =} a?,>0 with equality holding 
only when ad X =0. Hence S(X, Y) is positive semidefinite. Moreover, 
if G is semisimple, it is positive definite. It is clearly left-invariant: when 
X, Y, Z are left-invariant so is R(Z, Y)- X and S(X, Y), its trace. This means 
that S(X, Y) is a bi-invariant Riemannian metric on a semisimple G. How- 
ever two bi-invariant metrics can differ only by a scalar multiple. It follows 
that with a bi-invariant metric, G is Einstein. Note that this corresponds, 
except for a constant factor, to the metric on SO(n) of Example VII.8.6. J 


Exercises 


1. Prove that expression (3.4) depends only on the plane z determined by 
the vectors X’, Y”. 

2. Show that for any orthonormal basis F,,..., F, at pe M, the values of 
S(X,, Yp) = Yie1 R(Fip, Xip, Yp» Fip) is independent of the choice of 
the orthonormal basis and that this formula defines a C®-tensor field 
S(X, Y) as claimed in Definition 3.8. Verify that S(X, Y) = S(Y, X). 

3. Show that S(X,, Y,) is the trace of the linear operator taking Z, € T,(M) 
to R(Z,, Xp); Ye R (M) and use this to show that S(X, Y) is C® for all 
C®-vector fields X, Y. 

4. Show that on a compact Lie group G with a bi-invariant Riemannian 
metric, the curvature is identically zero if and only if G is Abelian. 

5. Show that if g is the Lie algebra of a Lie group G (compact or not), then 
the center of G is discrete if and only if the center of g, 
c= {Xeg|[X, Y] = 0 YY €g}, is {0}. 

6. Show that a Riemannian manifold of constant curvature is Einstein. 


4 The Curvature Forms and the Equations of Structure 


We now return to the viewpoint of Section VII.4. Let U be a neighbor- 
hood on the Riemannian manifold M such that on U is defined a C® family 
of coframes 0', ..., 0” and thus, automatically, a dual C® family of frames 
E,,..., Ep. They may or may not be coordinate frames of a coordinate 
neighborhood U, g. The components of the Riemann metric on U are still 
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denoted by g;; = (E;, E;) however, and according to Theorem VII.4.6, there 
exist uniquely determined one-forms 6 on U satisfying 


(i) do = ya Gi, l<i<n, 
(ii) dg; = Ye Hong + Ve Gin l<iij<n. 


[We remark that by defining 0;, = Èx 64g,;, the equations (ii) assume the 
simpler form dg;; = 0;; + 6;;.] In the special case where the frames are 
orthonormal, that is, g;; = 6;;, we will use w', œj instead of 6° ^”. Then (ii) 
becomes 0 = œ} + œj, 1<ij<n 

The forms 8} determine, and are determined by the Riemannian connec- 
tion. Thus if VE; =},I}E,, then 6) = ),Tj,0" equivalently, 
Vx E; = |}, O(X)E,. The one-forms 6%, 1 < j, k < n, are called the connec- 
tion forms. (A word of caution: rt = = I% only if E,,...,E, satisfy 
[E;, E;] = 0, as is the case for coordinate frames. This symmetry was derived 
from Vg, E; — Vg, E; = [E;, Ej], which we have made part of the definition of 
Riemannian connection; it is equivalent to (i) above.) 

Now suppose that R?,,, 1 < i, j, k, l< n are the components of the 
curvature (as an endomorphism) relative to the given frames, that is, 
R(E,, E,) ` E; =); Rie E;. Then we define n? two-forms Q}, 1 < i,j < nby 


Q= YO Ruat Ht VY Rep ane! 
l<k<I<n 


ktel 


It follows that 


X OVE, , E)E; = } Rju E; = R(E, E); E; 
= j=1 


and by linearity this extends to any vector fields X, Y so that 


R(X, Y): E; = YOUN, Y)E;; 
Pi 
thus (Q}{X, Y)) is the matrix of the curvature operator relative to the basis 
E,,..., E,. Note that the properties of R(X, Y)- Z imply that Q/(X, Y) at p 
depend only on the values of X and Y at p, not on the vector fields; ob- 
viously QUX, Y) = —Qi(Y, X). These n? forms Q} on U, are called the 
curvature forms; they depend on the Riemannian metric and on the particu- 
lar frame-field we use on U. The following result shows the relation between 
these forms and the connection forms. 


(4.1) Theorem Using the notation above, the forms Q/ on U are defined by 
the equations 


(4.2) Qj = doi — Yn, HAO, 1Sij<n. 
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Proof It is sufficient to verify that on any vector fields X, Y on U the 
value of the two-forms on each side of the equation is the same. This is 
equivalent to showing that 


R(X. Y): E, =F (a z ob not) x, vje, ee 
j 
By definition, 
R(X, Y)- E;= Vx(Vy E) — Vy(Vx E;) — Vix, n Ei, 

which may be rewritten 

R(X, Y): Ei = Va( 5940796) = Vi F408.) — E AOX YDE. 

j j j 
Since 6/(Y) and 6/(X) are functions, the right-hand side is equal to 
2 (XOKY)) — Y(@XX)) — OULX, YE; 
+ 2 OTIBO, — 2 OOE E 
is j. 

Applying Lemma V.8.4, this becomes 


5 [aoil Y) = F COA) — PACONE;. 


j 


which shows that, as claimed, 


R(X, Y): E, E= X (ai -— Ea a(x, Y) E 


J 


This completes the proof. i 


(4.3) Remark In summary, we have the following facts. Let U be any 
open subset of a Riemannian manifold M on which is defined a field of 
coframes 0@',..., 0". Let E,,...,£, denote the uniquely determined dual 
frame-field and let g,; = (E,,E,;) on U. Then there exist n? uniquely 
determined one-forms 6/ on U satisfying conditions (i) and (ii) of the first 
paragraph of this section. They determine the two-forms Q}, and hence the 
curvature on U, by (4.2). Equations (i), (ii), and (4.2) are known as the 
equations of structure; they are due to Elie Cartan, who made extensive use 
of them. As noted above, it is often convenient to write 6, = $., 0? gs; so that 
tH) ae a simpler form. We may define, similarly, Q; = °, QF gs; then 

ae = 5 Jy Rin A@' since we have previously seen that 

Rija = }s9js Ra, where Riu = R(E,, E,, Ei, E;) by Definition VII.4.5. 
The symmetry properties of Corollary 3.2 imply that Q; = -Q;;. 
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In the event that the frame-field is orthonormal, that is, consists of vec- 
tors E,,..., E„ with (E;, E;) = 6,;, then as noted above, (i) and (ii) simplify; 
moreover, Q; = O}, Ria = Røa and œf = w,;. Recapitulating the remarks 
above we have the following corollary. 


(4.4) Corollary The forms w',..., œ” dual to a field of orthonormal frames 
determine uniquely a set of one-forms œ, 1 < i, j < n, satisfying 
(i) dai =), 0% a ai 

and 

(ii) wa +o 
and we have 

(iii) da} — Dy, OF Aw = det Rook Aw! = Q} = Qj. 
Relative to these frames the matrix (Q,,(X, Y)) of the curvature operator 


R(X, Y) is a skew-symmetric matrix. 


(4.5) Corollary Let If, denote the coefficients of the connection forms rela- 
tive to coordinate frames E,,..., E,, of a coordinate neighborhood U, œ, that is, 
6 = $, T% 0" with 61,..., 0" being dual to E,,..., Ep. Then T$; = T$; and 


ori, or} 

ax* = Ax! 

Proof According to the theorem 

Qi = dei — $ OFA 0j; 
h 


j= 
Rig = 


= È (Ti Ti, = raTa). 
h 


hence 
Qi = F (dTi a 0! + T} dO!) — Y Y HTO A 0.. 
1 kih 

Now If, = T} since [E;, E;] = 0 for coordinate frames, and it follows that 
d=), # a0; =F, TAG =0 (since 6 6' = —6' 6). Therefore 
the second equation above may be written as 

L 2 ; 1/3r} éYi; 

= j k 1 = bal li = ki 

2 jae ape’ vasa ax! 


kt 


Jenne 


1 ' , 
-5 È È (Tarh — TTL) A 6. 
k,ł h 


Since we have made the coefficients on both left and right skew-symmetric in 
the indices k, l, these equations imply equality of coefficients. We use the 


symmetry of T$, in i, j, the fact that 0* a 6! = —6' ^ 6*, and change of index of 
summation where necessary to obtain the (standard) formula of the corol- 
lary. | 
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Consider the special case in which dim M = 2 and assume, moreover, 
that only orthonormal! frames are used. 


(4.6) Corollary Ifdim M = 2, then dœ? = Q? = — Kw! aw’, where K is 
the Gaussian curvature of M. 


Proof In proving Gauss’s Theorema Egregium we saw that if E,, E, are 
orthonormal unit vectors, then 


K = —R(E,, E,, E,, E,) = —(R(E,, E2) Ey, E,) = —Rj212- 
On the other hand, since g;; = (E;, E;) = 6,; we have 
OF = Q = +Ri21201 Aw”. 


Since wl + œi = 0, of = 0 = œ}. Thus } 2-01 aok = 0 and dof = Q} by 
Corollary 4.4, This completes the proof. | 


Note that these equations are independent of the particular orthonormal] 
frame field on U œ M. We shall now use this to give a geometric interpreta- 
tion of sectional curvature. Let x be a plane section at a point p of M, a 
Riemannian manifold, and let N, be an open, two-dimensional submanifold 
of M consisting of geodesic arcs through p and tangent at p to the section z. 


(4.7) Theorem If we use on N, the Riemannian metric induced by that of M, 
then the sectional curvature K(x) is equal to the Gaussian curvature of N „at p. 


Proof Let U = exp, B, be a normal neighborhood of p, that is, we 
choose £ > 0 such that B, = {X,€ T,(M)| ||X,|| < £} is mapped diffeomor- 
phically onto an open set Uc M. zie plane section x corresponds to a 
two-dimensional subspace V, < T,(M) and we may suppose that N, is the 
image of V, © B,. Since U is a normal neighborhood, it is covered simply by 
the geodesics of length e issuing from p; they are given by exp, txX,, 
0 < t < e, for each X, with |X, = 1. Now choose an orthonormal basis 
Eip,- Enp OL TM `), with E,,, E2p a basis of V,. Then (x',..., x") — 
exp,(>, x'E;,) establishes a system of normal coordinates on U, the coordi- 
nate map p being the inverse of the above. Thus N, is cee by 
x = = x" = 0, and U ^ N,, ọ is a coordinate system 4 on N, with x? , xX? 
as coordinates. Let E,,..., En denote the coordinate frames; : they agree at p 
with the given frame and E,, E, are tangent to N, everywhere on N,. We 
denote the dual coframes by 6’,..., 6", with 6) = $}; r} 0' as connection 
forms. Note that I'f(0) = 0, that i is, @ = 0 at pe U. This was proved in 
Remark VII.6.8. 

From those frames, by the Gram-Schmidt process! we obtain a family of 
orthonormal frames F,,..., F, in U with the property that F,, F, are a 


t See, e.g., Hoffman and Kunze [1]. 
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linear combination of E,. E, and thus tangent to N, at each of its points. We 
denote by w!,...,@" the dual coframes to F,,..., Fẹ, and by coi the corre- 
sponding connection forms; they satisfy the equations œf + w = 0 and 
dœ = F, a" ^ œ}. We shall see that for j > 2, w} = œw} = Oat p. First recall 
that at p, Vx, E: = } j #(X,)E; =0 and Vy, Fi = X @i(X,)F;. Now for 
i= 1,2, F; = alE, + a?E, and so 


Vx,Fi = (X„a!)E, + (X,a?)Ez + a}Vy,E, + @V yx, En. 


Since I% (0) = 0, the last two terms vanish so that for i = 1, 2, Vy, Fi is a 
linear combination of E, and E,, and hence of F, and F,. Thus 
Vx, Fi = oi (X,)Fı + oX JF: for i = 1, 2and w/(X ,) = Ofori = 1, 2 and 
j>2as claimed. 

Now we denote by 1:N,—>M the rere and let @' = I*w’, 
@/ = I*w!. Since I* is a homomorphism of A(M) > /\(N,) and commutes 
with d, we know that dõ' =}, &* A Gj, and @ a, = 0. Moreover, &' = 0 
for i > 2, since F,, F, spans the tangent space to N, ‘and @ (F) = (Po MF) 
= wi(l, F;) = w(F) =0 if j= 1 or j=2 and i>j Thus ai, ð? are 


dual to F,, F, restricted to N „and together with @} = — &; satisfy equations 
(i) and (ii) (of Remark 4.3), which determine the connectar forms uniquely. 
It follows from Corollary 4.6 that d@? = — Kõ' ^ &?. On the other hand. 


we have on M 


k l 
2? =F okno + Y Rimon 
k k<l 


and applying I* to both sides and evaluating at p yields the equality (at p): 
d@? = R212 Ô! AG”. 


It follows that the sectional curvature K(x) = R1212 = K p the Gaussian 
curvature at p of the surface N,. This completes the proof. | 


(4.8) Corollary Let M be an n-sphere of radius a in R"*' with the Rieman- 
nian metric induced from R"*'. Then M has constant sectional curvature 1/a?. 


Proof If p is a point of M, then the geodesics through p tangent to a 
plane x in T,(M) are great circles and form a 2-sphere of radius a. We have 
seen that the Gaussian curvature of such a 2-sphere is 1/a° so the corollary 
follows from the theorem. | 


We have made a distinction between isotropic manifolds and manifolds 
of constant curvature. A theorem of Schur [1] shows that this distinction is 
artificial. 


(4.9) Theorem If M is a connected, isotropic Riemannian manifold and 
dim M > 3, then M has constant curvature. 
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Proof If we let K „be the value of the sectional curvature at p—the same 
on all sections by hypothesis—then we must show that this function on M is 
constant, that is, dK = 0. Let U be a neighborhood of p € M with an ortho- 


normal frame field and let œ+, ..., œ" be the dual coframe field. Using the 
expression for Rim in Corollary 3.6, which now becomes 
Rija = —K(5x5 — Suð), we have Q} = Q; = —Ko' a œ, in which K 


depends only on p, not on the (orthonormal) frames used. Taking the exte- 
rior derivative of the structure equation 


da! = Y af Aw) + Q, 
we obtain 
0 = ¥ (data wi — wf Adal) — dK Aw! aa! — K do na! + Ko ado. 
Substituting for dew‘, dw', and so on, from Corollary 4.4 and simplifying gives 
dK aowa w = 0, 


which holds for every i,j = 1,...,n. Since dK = K,w' + + K, o”, a 
linear combination of œ!, ..., œ", and since œw! Aw! ^a œ # 0 if l, i, j are dis- 
tinct, this can only hold if dK = 0 on U, a neighborhood of p. Because p is 
arbitrary, dK = 0 and K is constant. i 


According to Corollary 4.8, the sphere of radius a with the Riemannian 
metric induced by the Euclidean space with contains it has constant positive 
curvature. Euclidean space itself with its standard Riemannian metric has 
curvature identically zero, since with the usual coordinates T}, = 0 and 
Rija = 0. It remains to give an example of a manifold of constant negative 
curvature of arbitrary dimension. This will be done in Section 6, which is 
devoted to spaces of constant curvature. In the meantime, the reader should 
try Exercise 1. 


Exercises 


1. Show that the two-dimensional manifold M = {(x, y) | y > 0}, the upper 
half-plane, with Riemannian metric in the xy-coordinates which cover 
M given by gı = 1/y? = g22 and gy, = 0 = go, has Gaussian curva- 
ture K = —1. 

2. Let 6',..., 0" be n linearly independent one-forms defined on an open 
subset U < R" and let (g,;(x)) be a symmetric positive definite matrix 
whose entries are C” functions on U. Show by direct computation that 
there exist uniquely determined functions I;,, 1 < i, j,k < n such that 
the n? forms 0; = )7_, T; 0 satisfy the two systems of equations: 


(i) do =), 0 ^0;g", and 
(ii) dg; = Oi + Oji- 
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5 Differentiation of Covariant Tensor Fields 


Until this point we have used covariant differentiation D/dt on Rieman- 
nian manifolds and the associated Riemannian connection V only to differ- 
entiate vector fields—either along curves or in various directions X, at a 
point p of the manifold M. However, once this has been done, it is a rela- 
tively simple matter to extend the procedure to tensor fields; for the basic 
difficulty—lack of a method to compare vectors, tensors, and so on, on 
tangent spaces at nearby points q to a given p € M —has been surmounted. In 
Euclidean space we compare T,(M) and T,(M) by parallel translation; we 
have also used the local one-parameter group of transformations generated 
by a vector field X to define the Lie derivative Ly. Neither procedure is 
available in the general Riemannian case; but there we do have parallel 
transport along a curve from p to q and that is what we now apply. For 
convenience, and since it is all that we need, we restrict our consideration to 
covariant tensor fields. Just as in our earlier treatment of differentiation of 
vector fields on a Riemannian manifold, we first differentiate a covariant 
tensor field along a curve and then, later, determine its derivative in various 
directions X, at a point p of the manifold. 

We consider, then, a covariant tensor field ® of order r on the Rieman- 
nian manifold M, ®e 7'(M), and we suppose given a curve p(t),a<t < b, 
on M of differentiability class C’ at least. Let ®,,,, denote the restriction of ® 
to p(t). Then ®,4)€ 7" (Ty(M)), that is, py is a tensor field along p(t). 
Using Theorem VII.3.12 and Remark VII.3.13, we denote by 1, parallel 
translation along p(t) from a fixed point p(t) of the curve: 


Ti: Treo (M) => Taa (M). 


This is an isomorphism of these tangent spaces and is uniquely determined 
by the curve p(t) and the Riemannian structure. It is exactly what is needed 
to define the derivative of ® at p(to). 


(5.1) Definition With the preceding notation, the derivative D®/dt of the 
tensor ® along the curve is defined at the point p(tọ) by 


DO A 1 

dt J -to (TAP 9) — Pito): 
o >to 

As thus defined (D®/dt), is a covariant tensor of order r on the vector 

space T,1)(M). In fact, given any set of r vectors X jigs -> X peo E Tp (M) 

then D®/dt at p(tọ) is the limit. as t > to of the expression 


Pm tg EP vA X a)» ++ Xoo) = Poo lX pto +++ Koto) 
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which for each value of t near tọ is a multiple [by 1/(t — to)] of the difference 
of two tensors t*®,,,. and ® ro On Tpu (M). Since both are covariant r 
tensors on the same vector space, it follows that the limit is also such a 
tensor. Repeating this procedure at each ty on the interval (a, b) gives a 
covariant tensor field D®/dt along p(t), provided that suitable differentiabil- 
ity conditions are satisfied. We mean by this that for any C* family of vector 
fields Xi = Xi, i= 1,..., r, defined along the C* curve p(t), the value of 


p(t)? 


D@®/dt on them, 


D® iyi. Xt), a<t<b, 

dt 
should be a function of class C*7! (C” when k = œ) of the variable t. In 
particular, this should be true in the most frequent situation: X',..., X" are 
C”-vector fields on M and X},..., X7 are their restrictions to the curve p(t). 
In order to see that this is indeed a consequence of our definition and to 
derive computational formulas, we prove the following lemma. For 
convenience—and since it is the most important case—we suppose ® is C”. 


(5.2) Lemma Let ® be a C*-covariant tensor field of order r on M and let 
p(t), a < t < b, be a curve of class C*, k > 1, on M. If Xj, ..., X1 € Tio (M) 
are vector fields of class C* along the curve, then for each ty on the interval 
(a, b) we have 


DỌ d 
(5.3) Cr). (Xb Xio) = [Mw (Xt,--- o) 
DXi 


- E Ono (Xb (FF | esati: 


Proof Before beginning the proof we note that it will indeed establish 
the fact that D®/dt evaluated on C*-vector fields along the curve is differen- 
tiable of class C*~! at least. If k = œ, as will often be the case, then D®/dt 
will be a C®-tensor field along the curve, that is, its value on C”-vector fields 
will be a C® function of t. Although ® itself has been assumed C®, in fact it is 
sometimes convenient to consider curves of lower differentiability class, 
which in turn lowers the class of D®/dt. The assertion of the lemma is easy to 
prove. By definition we have 


Do) n 1 ! ; 
(7) = tim 1 (ef (Xb, XI) ~ Ppl Xb os X) 
(è) 


to t>to t— 


t 1 r r 
= lim IE (Py (t,(X io) ...3 (Xio) aa ® 100)(Xt0 jasi X%,))- 


{>to t (0 
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Then for each i = 1,..., 7 in turn we subtract and add 


UM (Xi, XZ, -3 Xi, (Xi h ree a0. 6) 5 


to 


Rearranging and collecting terms and using both linearity at p(t) and the 
continuity of the tensor ®, we may rewrite the defining equation 


: 1 
(77). È 0(X! seo lim —— (t,(Xio) = Xi) t AXE h a) 
i t>to 10) 


<: 1 r i 9 
+ Tim (pal X?, seg De stead) 


t>to 


We now use the fact that for any C*-vector field X, along p(t), 


= = X, 

i tM) Me timg, (5*9 ža) _ ef” ) = B -(2%3) l 
to tto pane t— to dt dt Jo 
Therefore passing to the limit in the expression for (D®/dt),, completes the 
proof of formula (5.3) of the lemma. I 


We can verify from the formula itself that (D®/dt), depends R-linearly on 
the values of the vector fields X},..., X7 at p(to) so that the formula does 
define an R-linear function, that is, a covariant tensor of order r on the 
vector space T,,,.(M). This is made even clearer, however, by the following 
corollary—which uses the notation above. 


(5.4) Corollary Let X},...,X0€Tyo(M) be given and suppose that 
X},..., Xy are the uniquely determined parallel vector fields along p(t), 
a < t < b, which take these values at p(to). Then formula (5.3) becomes 


DO d 
(ah (Xi, X) = (F Oni- 9} 


Proof This follows from (5.3) since by definition of Xj we have 
DXi/dt = 0, i = 1,...,r. ] 


tato 


This corollary makes it clear that (D®/dt), depends only on the tensor 
field ® and on the curve p(t), a < t < b. In fact, it is easy to verify from the 
formulas above that it depends on the tangent vector Ypo) = P(to) to the 
curve, but not on the curve itself, more precisely, two curves through p(to) 
with the same Pan vector at that point will define the same element 
(D®/dt),, of 7"(Tyao)(M)). We shall state this in the form of a lemma. 


(5.5) Lemma Let ® be as above and let pe M. We suppose that X',..., X" 


are C®-vector fields on some neighborhood U of p and let X}, ..., X', denote 
their value at p. Let F(t), —£ < t < £, and G(s), —6 < s <6, be two C? curves 
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on M such that F(0) = p = G(0) and F(0) = Y, = G(0) is their common tan- 
gent vector at p. Then 
D® D® 
(77) (X,..., Xt) = (7) (X3, ..., X5), 
o 0 
that is, the two tensors on T,(M) defined by differentiating ® along each of the 


curves is the same. 


Proof Suppose that f is a C® function on U. Then f(F(¢)) is its 
restriction to the curve F(t) and 


d d 
(GEO) = FGM = BS 


Similarly restricting f to G(s), differentiating with respect to s, and evaluating 
at s = 0 gives Y,f. Applying this to the function 


Sla) = DXi o XG). 


we see that in formula (5.3) the first term in case of either curve (and 
derivative of ®) is the same, namely Y,(®(X',..., X”)). On the other hand, 
by our original definition of Vy, X for a vector field X in Chapter VII we 
have Vy, X = (D/dt(X po) = (D/ds(X'y)))o: hence the other terms in 
formula (5.3) agree also, which establishes the lemma. | 


We denote the covariant tensor of order r on T,(M) which we have thus 
defined from differentiation of ® along curves through p with Y, as tangent 
at p by Vy ,®. 


(5.6) Definition The covariant r tensor on T,(M) just defined 
Vy DET "(T,(M)), is called the covariant derivative of ® at p in the direction 


Y,. 


According to the facts in the proof above. the covariant derivative is 
given by the formula 


(5.7) Vy, O(X,..., X) = YOX... X) 
— EDX}... Vyr, X5 XI), 
i=1 


where X',..., X" are vector fields on a neighborhood of p. Only their values 
at p affect the value of Vy, ® on T,(M). 


(5.8) Theorem Given De 7'(M) as above, then we may define on M a 
C®-covariant tensor field Y of order r + 1 by the formula 


W(X4,...,X%; Yp) = (Vy, OXX}, ... XP). 
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Proof In view of all that has been shown above it is only necessary to 
prove two more facts: first that for each p e M, Y, is linear in the last variable 
—with the others fixed. Second, that for any C”-vector fields X',..., X”, Y, 
the formula above defines a C” function of p. 

The first fact is a consequence of the linearity in Y, of each term of (5.7) 


as a real-valued function on T,(M). Thus, if we fix the vector fields X',..., X", 
then the mapping T,(M) — R defined by (5.7) 


Y, > (Vy OX}, ..., X5) 


is linear. On the other hand, it is clear that for C* -vector fields X',..., X”; 
Y the function ¥(X', ..., X"; Y) = (Vy ®)(X,, ..., X,) is C7. I 


It is not difficult to give formulas in terms of the components of ® in local 
coordinates for the components of VY. We shall give the formulas and leave 
their verification as an exercise. We suppose that U, ọ is a local coordinate 
system with local coordinates x',..., x", coordinate frames E,,..., E,, and 
with Vs, E; = }, T$ E,. Let Ọ be as above and let ®,,...;, = O(E;,,..., E;,) 
be its components. 


(5.9) Corollary With the notation above, the components 
y = P(E; -e Ezaa) 
of ¥ on U are given by the formulas 


jise drt 


(5.10) Definition A tensor field Pe 7'(M) is said to be parallel along a 
curve p(t) if D®/dt = 0 along the curve. It is said to be parallel if D@/dt = 0 
along every curve on M. 


We remark that if Vy,® = 0 for every X „€ T,(M) and all p€ M, then it is 
parallel; so in fact if it is parallel along geodesics, for example, then it will be 
parallel. This follows from Lemma 5.5 and the fact that there is a geodesic 
tangent to any given vector X,. 

We also note that if p(t), a < t < b, is a curve of class Ct, say, then ® is 
parallel along p(t) if and only if it satisfies 


d 
£ oxi.. X7)) 20 


for every set X},..., X? of parallel vector fields along the curve p(t). 
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(5.11) Example Let M be a Riemannian manifold of constant curvature 
K. Then, by definition, for any orthonormal pair of vectors X,, Y, the 
sectional curvature R(X,, Y,, Xp, Yp) = —K. Suppose p(t) is any curve 
through p with, say, p(0) = p. Let X pq), Yp be the uniquely determined 
parallel fields such that X, = Xpo and Y, = Ypo). Then Xp. Ypo is 
orthonormal at each p(t) and R(X yu), Ypo» X p> Yi) = —K, a constant 
independent of t. It follows that for any parallel vector fields along p(t) say 
Xi, i= 1,2, 3,4, that 

d 

— R(X}, X?, X?, X$) = 0. 

dt 
Indeed the values of all of the sectional curvatures uniquely determine the 
curvature; thus the curvature is parallel if it is constant on parallel sections 
nm, along any curve p(t). 


We might think that this is the only case in which the curvature tensor 
R(X, Y, Z, W) is parallel, but in fact this is not the case as we shall now see. 


(5.12) Theorem (Cartan) If M is a Riemannian symmetric space, then the 
curvature tensor is parallel. 


Proof We know that any isometry of a Riemannian manifold preserves 
parallelism; it carries parallel vector fields, sections, and so on, along a curve 
to parallel vector fields, sections, and so on, along the image. Moreover 
isometries preserve the curvature, 


RX, : Y, > Z, ’ W,) = Re (X rep > Yew) 2 Z Fp) ’ Weep) 


Finally isometries carry geodesics to geodesics. This is because each of these: 
parallelism, curvature, and geodesics is defined in terms of the Riemannian 
metric. Now to show that the curvature is parallel, it is enough to show that 
it is constant on parallel vector fields along geodesics. However, if p(t) is a 
geodesic, then the vectors X pa, Ype)» Zpay» Wow Of the parallel vector field 
determined by X xo)» Ypo)» --- are given by isometries t, of M according to 
Theorem VII.8.7. Therefore the curvature is constant on parallel fields along 


the geodesic p(t), which proves the result. I 


It is important to realize that this is more general than constant curva- 
ture. We have seen an example of a symmetric space—a compact semisimple 
Lie group G with bi-invariant metric—in which the curvatures on various 
sections 7, at the identity vary between 0 (if there is an Abelian subgroup of 
dimension two) and a positive maximum value (see Theorem 3.9). Thus G is 
not isotropic, hence not of constant curvature in this metric, but it does have 
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parallel curvature. This raises the interesting question of how those Rieman- 
nian manifolds with parallel curvature may be otherwise characterized or 
described. The answer to this is given by the following two theorems which 
will not be proved in this text but are quoted for use in the next section and 
for their general interest. Proofs are given by Wolf [1, pp. 30, 42]. 


(5.13) Theorem (Cartan) Let M be a Riemannian manifold with parallel 
curvature. Then M is locally symmetric, that is, each point pe M has a neigh- 
borhood U such that there is an involutive isometry o,: U —> U with p as its 
only fixed point. 


Of course, a manifold may be locally symmetric without being globally 
symmetric, that is, symmetric in the sense of our original definition of sym- 
metric space. For example, Euclidean space or a sphere-—with its usual 
Riemannian metric—is no longer a symmetric space if a single point is 
removed, since we have seen that a symmetric space is necessarily complete; 
but it is still locally symmetric. Even if completeness is assumed, together 
with parallel curvature, we still cannot be quite sure that the space is 
symmetric—some restrictions on the fundamental group may be involved. 
However, if the Riemannian manifold is complete and has parallel curva- 
ture, then we may be sure that its universal covering (with the naturally 
induced Riemannian metric) is a symmetric space. This is a consequence of 
the theorem which follows. A proof ofa more general version due to Hicks is 
given by Wolf [1]. 


(5.14) Theorem (Cartan-Ambrose) Let M and N be complete, connected 
Riemannian manifolds of the same dimension, each with parallel curvature, and 
suppose further that M is simply connected. If peM and qEN and 
p: T,(M) —> T,(N) is any linear mapping which preserves the inner product 
and the curvature; i.e., for arbitrary X,, Y,, Zp, W,€ T,(M), we have 
(9(X,), o(Y,))a = (X,, Y,)p and 


R,(9(X,), OY p), (Zp) (Wp) = R {Xp Yp» Zp, Wp), 


then there is a unique C® mapping F: M +N which has the properties: 
(i) F(p) = q, (ii) Fa: T,(M) > T,(N) is the same as ọ, and (iti) F is a Rieman- 
nian covering (that is, it is a covering such that F, is an isometry on each 
tangent space—thus a local isometry). 


Exercises 


1. Verify the formula of Corollary 5.9. 
2. Let ®(X, Y) be the Riemannian metric on M. Prove that Vz,® = 0 for 
every vector Z, at every point pe M. 
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3. Let o be a field of linear operators on M, that is, an element of 7 {(M), 
so that at each p, o: T,(M) > T,(M) is a linear mapping. Suppose for 
any X € ¥(M) that o(X) € ¥(M) also. Define a derivative Vy, o for cata 
point p in the direction Y,. Show that it will define a tensor field of type 
T?(M). 

4. Using Exercise 2 and the special expression in local coordinates which 
was given in Corollary 3.6 for Riju on a manifold of constant curvature 
K, show that the curvature tensor is parallel on any such manifold. 

5. Show that if an alternating covariant tensor ® is parallel, then the exte- 
rior differential form to which it corresponds is closed. Is the converse 
true? 

6. Let X bea C®-vector field on a Riemannian manifold and ® be a tensor; 
for definiteness let ® be the Riemannian metric tensor. Make a suitable 
definition of the Lie derivative L,® and interpret Ly ® = 0. 


6 Manifolds of Constant Curvature 


The manifolds of constant curvature, which we have introduced briefly in 
previous sections, are on the one hand the simplest Riemannian manifolds 
and yet on the other hand are sufficiently complicated to present a fascinat- 
ing object of study. Classically they are the oldest known examples in the 
sense that they include Euclidean space, the non-Euclidean spaces dis- 
covered by Bolyai and Lobachevskii, and the spherical and elliptic spaces, 
whose geometry was studied by Riemann (as examples of spaces in which no 
parallel geodesics existed). In fact locally, the geometry of any space of 
constant curvature is equivalent to one of these classical geometries as we 
shall see. What makes these spaces particularly intriguing is that questions 
about them can often be reduced to purely algebraic problems of an inter- 
esting nature. In the short scope of this paragraph we can only give an 
indication of the special flavor of this subject, but the books of Kobayashi 
and Nomizu [1] and Wolf[1] contain many details for the reader who 
wishes to go further. 

We recall that a Riemannian manifold M is said to have constant curva- 
ture if all sectional curvatures at all points have the same constant value K. 
This implies that the curvature tensor is parallel and hence that the manifold 
is locally symmetric according to Theorem 5.13, but the converse does not 
hold since there are many symmetric spaces, for example any non-Abelian 
compact Lie group with the bi-invariant metric, which are not spaces of 
constant curvature. According to Corollary 3.6 and to Theorem 4.9, or at 
least to its proof, it should be possible to give a characterization of Rieman- 
nian manifolds of constant curvature in terms of differential forms. To this 
end we suppose M to be a Riemannian manifold and let œw, 1 < i < n, 
denote the field of coframes dual to an orthonormal frame field E,,..., E, on 
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an open set U c M, with œj, 1 < i,j < n, denoting the corresponding con- 
nection forms. We then state the following lemma, whose proof is contained 
in Theorem 4.9 and Exercise 1. 


(6.1) Lemma If M has constant curvature K, then the curvature forms 
Qj = doj — }, wf A œj are given by 


Qi = -Ko a a’. 


Conversely, if on a neighborhood U of each point of M there is an orthonormal 
frame field E,,...,E, for which the uniquely determined œ', œw} satisfy this 
equation, then M has constant curvature K. 


We shall use this presently to give an example of a manifold of constant 
negative curvature. Before doing so we recall that Euclidean space with its 
standard Riemannian metric is a space of zero curvature and that the n- 
sphere of radius a in R"*! with the induced Riemannian metric has constant 
curvature K = 1/a?. Thus for every nonnegative real number K, we have 
already found an example of Riemannian manifold of arbitrary dimension n 
with constant curvature K. We shall next give an example of an n- 
dimensional Riemannian manifold of constant curvature K = —1. A slight 
variation (Exercise 2) will produce an example for any K < 0. 


(6.2) Example (Hyperbolic space) Let M be the open upper half-space 
of R" defined by M = {xe R" | x" > 0} with the Riemannian metric given by 
the line element (see Section V.3) 


a (dx!) po (dx")? 
ds? = (er? i 


More precisely, we note that, as a manifold, M is covered by a single coor- 
dinate system with local coordinates xt, ..., xX" and coordinate frames 
6/éx', ..., 8/ðx". This is because, as a manifold, M corresponds to an open 
subset of R”. In these local coordinates, the components of the Riemannian 
metric ® are given by 


ô ĉĉ Ôi; 
gy(x) (ra) (x")? 
We use Lemma 6.1 to see that this manifold has constant curvature K = —1 


as claimed. (When n=2, this is the Riemannian manifold of 
Example VII.9.5.) 

Let E; = x"(@/éx'), i = 1,..., n; these define an orthonormal frame field 
on all of M. We denote by w!, ..., œ" the dual coframes which are given by 
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wi = (1/x")dx', i=1,...,n. It is easy to verify that the forms 
œj = 6,;0' — ô satisfy the equations 
n 
dœ = Yoinw, and œj+ oi =0; 

j=1 
hence they must be the connection forms since these are uniquely 
determined by these conditions. Finally, taking the exterior derivative of œl 
we obtain 


oii TOE FE 
Qi = dai — } of nw] = +w n ow. 


Then from Lemma 6.1 it follows that M has constant curvature K = —1. 
We call this hyperbolic space and denote it by H” (for its underlying space, 
the “half-plane ”). 


Thus we have examples of spaces of positive, zero, and negative constant 
curvature. Note that all three examples are simply connected: when K > 0, 
our example was the compact manifold S” and, when K = Oor K = —1, the 
corresponding manifolds E" and H” are diffeomorphic to R". For conve- 
nience, in what follows we suppose that S” has radius +1 and hence K = +1. 
Since S” is compact, it is complete; we also know E” to be a complete 
Riemannian manifold; and we shall prove later that H” is complete. The 
importance of these facts stem from the following theorem (which goes back 
to much earlier work of Killing and Hopf). 


(6.3) Theorem Every complete, simply connected Riemannian manifold M 
of constant curvature K = +1,0, or —1 is isometric to one of the three 
examples above, in fact to S" if K = +1,toE"ifK = 0, and to H" if K = —1. 
More precisely, given p€ M, and q in either S", E", or H” according to whether 
K = +1, 0, or —1, and given a prescribed linear map of T,(M) onto the 
tangent space at q which preserves the inner product, then there is exactly one 
isometry F of M to the corresponding space of constant curvature taking p to q 
and such that F, corresponds to the given linear mapping on T,(M). 


This is an immediate consequence of Theorem 5.14 once we know that 
H” is complete—which is proved later. We remark that using Exercise 2 and 
spheres S" of arbitrary radius, we may extend this theorem without difficulty 
to spaces of any constant curvature. Although Theorem 5.14 is not proved in 
this text, this consequence of it will be used. We have the following obvious 
corollary of Theorem 6.3. 


(6.4) Corollary Let M be S", E”, or H" and let E,,,.--, Enp E1qs+-+ Eng 
be orthonormal frames at two arbitrary points p, q of M. Then there is a unique 
isometry of M taking p to q and E,, to Eig, i = 1,...,n. 
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This shows that the group of isometries is transitive on M and makes it 
plausible that in each of these cases it is a Lie group. We already know this 
however for S", whose group of isometries is O(n + 1) and for E", whose 
group of isometries consists of rotations and translations and their products 
(Example III.7.6). The group of all isometries of H” will be studied only in 
the special case n = 2, although some indication will be given of the general 
situation. 

In order to carry our study of these spaces somewhat further we make 
some comments concerning covering spaces. If M is a Riemannian manifold 
and M a covering manifold with covering mapping F: M — M, then there is 
a unique Riemannian metric on M such that F is a local isometry. When M 
has this metric, the covering will be called a Riemannian covering. The fol- 
lowing facts are quite easily verified from the definitions: (i) F carries 
geodesics to geodesics and each geodesic on M is covered by a unique 
geodesic on M; (ii) If M is complete, then M is also complete (convergence 
of Cauchy sequences is a local phenomenon); and (iii) the covering transfor- 
mations are isometries of M. With the aid of these facts one may reduce the 
determination of manifolds of constant curvature to a group theoretic 
problem—or at least make the first step in that direction. 


(6.5) Theorem Let M be a complete manifold of constant curvature 
K = +1,0, or —1. Then the universal covering manifold M is isometric to S”, 
E", or H", respectively, and M is the orbit space of a subgroup T of the group of 
isometries of M which acts freely and properly discontinuously on M. 


The theorem follows from the fact that M is complete, simply connected, 
and (since the covering mapping is a local isometry) has the same constant 
curvature as M. We know from the theory of covering spaces that M = M/T 
and that the covering transformations I act freely and properly discontin- 
uously (as a group of isometries). We give some indication of how this may 
be used by considering some examples. 


Spaces of Positive Curvature 


In order to find Riemannian manifolds of constant positive curvature 
K = +1, it is necessary to find subgroups I of the group of isometries of S”, 
the unit sphere, which act freely and properly discontinuously on S”. The 
isometries of S" are contained in O(n + 1) which acts in the usual way on the 
unit sphere in R"**, hence I c O(n + 1). The assumption that T acts freely 
means that no element of I, except the identity, leaves a point of S” fixed. 
Thus if del and A Æ I,A cannot have +1 as a characteristic value. 
Moreover, I’ must be a group of finite order, since otherwise there must be 
an x € S" such that [x = {Ax | Ae T} has a limit point, which would contra- 
dict the proper discontinuity. Thus we must find finite subgroups of 
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O(n + 1) no element of which (except the identity) leaves a vector x fixed. 
This is clearly a necessary condition for I’, it is also sufficient (Exercise 3). 

The simplest example of a subgroup I of O(n + 1) of the type described 
is the group consisting of two elements, l = {+ I}. The orbit space S"/T is 
the collection of all antipodal pairs of points on S” and is, as we have 
mentioned earlier, just the projective space P"(R) (Example III.2.5 and 
Exercise III.2.3). Thus for every n we have at least two inequivalent spaces of 
constant curvature—real projective space and its universal (Riemannian) 
covering space S". When n is even, we have the following fact. 


(6.6) If n is even, then S" and P"(R) are the only complete manifolds of 
constant curvature K = +1. 


Proof This is seen as follows. Let I be a properly discontinuous group 
of isometries acting freely on S". Then I c O(n + 1) and each Aer is an 
(n + 1) x (n + 1) orthogonal matrix. Therefore A must have a real charac- 
teristic value since the degree of its characteristic polynomial is an odd 
number n + 1. Since the characteristic values of an orthogonal matrix are of 
absolute value one, A has +1 as a characteristic value. We have seen that 
only the identity on I can have +1 as a characteristic value, hence — 1 is a 
characteristic value of A. This implies that A? has + 1 as characteristic value, 
so A? = I. This means that each of the characteristic values of A is either + 1 


or —1, and hence, either all are + 1 and A = J, or all are — 1 and A= —I. 
This completes the proof when combined with the example mentioned 
above. i 


(6.7) Example When n is odd, other possibilities can occur. As an indica- 
tion we will show that in the case of S? there exist many examples of finite 
subgroups I" c O(4) which act freely on S? and thus give manifolds $3/T of 
constant positive curvature. The examples are based on the algebra K of 
quaternions, that is, on the real linear combinations 


q=x + yi+zj+ wk 


of the four symbols 1, i, j, k with the usual rules of multiplication and with 
componentwise addition (see Chevalley [1]). We denote by q, the conjugate 
of q, 
i q = x — yi — zj — wk 

and by ||q|| the usual norm |/q|| = (qq)'’”. Then K is in obvious one-to-one 
linear correspondence with R* and this corresponds to the standard norm in 
Rt. Hence K, = {q| ||q|| = 1}, the quaternions of norm one, correspond to 
S? c R*. As usual we identify K and R* as vector spaces and as manifolds 
and we identify K, and S? as manifolds. The important thing for us is that K, 
is a group with respect to quaternion multiplication, since ||¢,q.|| = 
lgilllg2l. Thus, if ge X,, then left translation L4: K— K defined by 
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L,(x) = qx is an R-linear mapping of K onto K and preserves the norm of x, 
that is, ||L,(x)| = |x|. This means that as a linear transformation of 
K = R4, L, is an orthogonal transformation. In brief, S? = K, is a group 
space and left translations are orthogonal transformations, in fact isome- 
tries, of S? with its usual Riemannian structure. Since no left translation 
except the identity can have a fixed point, we need only find examples of 
finite subgroups I of K,—each such example determines a three- 
dimensional manifold of constant positive curvature and they are all 
determined this way. 

To find finite subgroups of K, one uses the following fact (Exercise 4). 
There is a natural homorphism z: K, > SO(3) which is onto and has kernel 
+1 (+41 is the unit quaternion). This homomorphism is given as follows: 
Let R? be identified with the subspace of K of all quaternions of the form 
q = yi + zj + wk, with real part x = 0. Then to each q'€ K, we let corre- 
spond the rotation x(q’) of R? given by q > q’q(q’)" 1 Now given any finite 
subgroup T, c SO(3), then I = n` *(I,) is a finite subgroup of K,. Such 
subgroups of SO(3) are easy to find—the group of symmetries of any regular 
solid (omitting those of determinant — 1) give examples. The problem of 
classifying all complete spaces of constant positive curvature has recently 
been completed by Wolf [1]; classification of the finite subgroups of K, and 
of SO(3) is carried out as an example on p. 83ff of his book. 


Spaces of Zero Curvature 


Now consider the Riemannian manifolds which have Euclidean space of 
the same dimension as their universal Riemannian covering space; they are 
the (complete) spaces of zero curvature. Thus they are of the form 
M = E" /T, the orbit space of a subgroup T of the group of isometries (rigid 
motions) of £”. If we identify E" with R” and use vector space notation, then 
each isometry is of the form x—Ax+ 6, where AeO(n) and 
b = (b!,..., b"), and determine, respectively, a rotation and translation of 
the space (Examples III.7.6 and IV.9.4). Since, locally at least, the geometry 
of any such M is just that of Euclidean space, these spaces might seem to 
lack interest. This is not the case however; in particular, the global behavior 
of geodesics is very different from that of geodesics in E". We have already 
noted this in the case of two examples: the cylinder, which is just E’/I with 
I = {x +x + ne, |e, = (1, 0), ne Z}, and the torus T? obtained as the 
orbit space of the group of translations {x > x + ne, + me, |n, me Z, 
e, = (1,0), e, = (0, 1}. 

Historically the study of these spaces is closely linked to that of the study 
of crystal structures on the plane E? and in Euclidean space E?, that is, to 
uniform coverings of the plane by congruent polygons and of E 3 by congru- 
ent polyhedra. It is fairly easy to convince ourselves that the symmetries of 
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Figure VIII.9 


such crystalline structures—rigid motions carrying the structures onto 
themselves—form a subgroup I of the group of rigid motions which acts 
properly discontinuously (Fig. VIII.9). Elements of such groups may well 
have fixed points however, so these groups are somewhat more general than 
those which generate examples of manifolds of zero curvature. It was proved 
in the 19th century by several mathematicians independently (by 
classification of all possibilities) that there were only a finite number of 
crystal structures on E°. This gave rise to the question posed by Hilbert [2] 
in his famous address of 1900 as to whether the number of possible isomor- 
phism classes of properly discontinuous groups of motions I of £" for which 
the orbit space E"/T is compact is finite for every n. These are called crystal- 
lographic groups, and Hilbert’s question was answered affirmatively by 
Bieberbach [1] in 1911. This implies, in particular, that for every dimension n 
there exist at most a finite number of compact Riemannian manifolds of 
curvature zero. Among these, of course, is the torus T”, and it is a con- 
sequence of Bieberbach’s work that every such manifold has the torus as 
covering space. The proof of these theorems involves very interesting group 
theoretic arguments; it may be found in the books of Kobayashi and 
Nomizu [1] and Wolf [1]. In the latter book a complete classification of the 
manifolds of zero curvature in dimensions 2 and 3 is given; no general 
classification for all n is known. (See also Grove and Benson [1].) 


Spaces of Constant Negative Curvature 


First we consider H? as given in Example 6.2, except that we shall write 
(x, y) for (x!, x?) and identify H? with the upper half-plane of the complex 
numbers C by the correspondence (x, y) z = x + iy. Then H? is the open 
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subset of C, consisting of all complex numbers z with positive imaginary part 
Imz >0. We may then write the Riemannian metric, or line element 
ds? = Y? j=, gy dx' dx’, in the complex or real form 


i, j=l 
dzdz _ dx? + dy’ 
(mz? y * 


We have already considered this Riemannian manifold and its isometries 
(Example VII.9.5). The reason for passing to complex coordinates is that it 
makes it much simpler to define and work with the group of isometries. Of 
course, other representations of H? and its group of isometries are often 
used—some of which extend to H” for all n, but the technical difficulties 
would be greater for us at this stage (see Wolf [1, Section 2.4]). Recall that 
mappings on C of the form z> w = (az + b)/(cz + d), a,b,c, de C such 
that ad — bc + 0, are isometries of H? ; in analytic function theory they are 
called linear fractional transformations (see Ahlfors [1] for example). The 
following theorem restates Example VII.9.5 and adds a little to it. 


ds* = 


(6.8) Theorem The group G of linear fractional transformations such that 
a, b, c,d are real numbers and ad — bc = +1 is exactly the group of isome- 
tries of H? identified with the upper half-plane of C. The mapping 
F: SI(2, R) > G defined by letting the matrix (2°) correspond to the linear 
fractional transformation z> w = (az + b)/(cz + d) is a homomorphism of 
SI(2, R) onto G with kernel +1. 


Proof Except for the assertion that this group contains all of the isome- 
tries, these statements were all proved, or given as problems, in 
Example VII.9.5. To review briefly the arguments, which the reader should 
check in detail, we note that the last statement is verified by a straightfor- 
ward computation. Whereas to see that the first statement is correct, we note 
that if w is the image of ze H? by a transformation of G, then 


Im z 

——-——, >0 

Jez + df? j 

so that the upper half-plane maps onto itself. If we compute dw, we find that 
dz 

(cz + d? 


Im w = 


dw = 


from which it follows that 
dw dw dz dz 


(Im w)? (Im z)? 


so that ds? is preserved—a shorthand way of seeing that the components of 
gi; transform as they should for an isometry. [Of course, this mapping could 
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be given in terms of real and imaginary parts, that is, the functions, u(x, y) 
and v(x, y), such that w = u(x, y) + iv(x, y) could be computed and the 
mapping written without use of complex variables; but the computations 
become much more difficult.] In order to see that this group G contains all 
isometries, we recall first that it acts transitively on the upper half-plane and 
second that it is transitive on directions. Indeed, in the example cited it was 
shown that the orbit of i = ./—1 is all of H?, which implies transitivity and 
that the isotropy subgroup of i consists of elements of G corresponding to 
matrices in S/(2, R) of the form 


cos@ sin@ 
—sin@ cos} 


This subgroup of G is transitive on directions at i; in fact it acts as SO(2) on 
the tangent space to H? at i. These facts together with Corollary 6.4 prove 
the assertion. I 


We note that angles on H? in terms of the given Riemannian metric are 
the same as angles on R?, moreover—as is well known—linear fractional 
transformations are analytic mappings on the complex plane and as such are 
conformal, that is, they preserve angles between curves. We will also use 
from complex function theory the fact that linear fractional transformations 
carry circles and straight lines of C into circles and straight lines (see 
Ahlfors [1]). Thus any circle which is orthogonal to the real axis will be 
carried by any element of G into a circle orthogonal to the real axis or a 
vertical straight line. We have left it as an exercise to prove that vertical 
straight lines are geodesics of H?. Then it follows rather easily that any circle 
orthogonal to the real axis is also a geodesic. In fact a little simple Euclidean 
geometry (Fig. VIII.10) shows that through a given zọ € H? there is exactly 
one circle (or vertical line) tangent to each direction at Zzọ and orthogonal to 


20 


xy 


Figure VIII.10 
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the real axis. Since isometries take geodesics to geodesics, this gives every 
geodesic through zo. One important consequence is that every geodesic can 
be extended to infinite length so that H? is seen to be a complete metric 
space. It is sufficient to check this for just one geodesic, namely, x = 0, y = t, 
0 < t < œ. The length of this geodesic from t = ato t = b is f? dt/t so it is 
unbounded in both directions, that is as a > 0 or b —> œ, which shows it is 
indefinitely extendable. We also saw in Section VII.9 that H? is an example 
of a symmetric space, which means that it must be complete 
(Theorem VII.8.4). We have previously noted that H? is the space of non- 
Euclidean geometry and that it is easy to see from this description of 
geodesics that Euclid’s postulate of parallels is not satisfied (although all the 
other postulates of Euclid are!). This is illustrated in Fig. VII.15. This behav- 
ior of geodesics should be contrasted with that on S? and P?(R), spaces of 
constant positive curvature on which every pair of geodesics intersect—twice 
on S? and once P?(R). 

We turn now to consideration of H”, n > 2 as described in Example 6.2 
and use this information to verify that H” is complete. First we note that any 
translation of H” in a direction parallel to the plane x" = O is an isometry. 
The same holds for a rotation of the underlying R” which leaves x" fixed, in 
other words, a linear transformation of the variables x!,...,x"~' with 
orthogonal matrix is an isometry. Thus any 2-plane determined by a point 
xe H" and unit vector X, at x can be carried to the submanifold 
H? = {xe H" |x! =+ x"! = 0} by an isometry of H”. If we then verify 
that geodesics of H? are geodesics of H” (Exercise 10), it will follow from the 
above facts concerning H? and known properties of geodesics that every 
geodesic of H” can be extended to infinite length. This means that H” is 
complete; it also means that the geodesics of H” are exactly the semicircles 
whose center lies on the (n — 1)-plane x” = 0 and whose plane is perpendic- 
ular to it. 

The geometry of H? is extremely useful in analytic function theory and 
the subgroups I of G which operate properly discontinuously on H? are 
extensively studied in automorphic function theory (see Lehner [1] and 
Siegel [1]). In fact automorphic functions are precisely those complex analyt- 
ic functions on H? whose value is the same at each point of the orbit of some 
such T. Thus they define functions on H?/I, the space of orbits. This is 
analogous to doubly periodic functions on C which take the same value at 
each point of the orbit of a group T of the form I = {z > z + mw, + nw} 
for (independent) w,, w,¢C and thus define functions on C/I = T°. The 
best known automorphic function is the one associated with the subgroup of 
G for which a,b,c, d are integers in each linear fractional transformation, 
that is, the image of SI(2, Z} c SI(2, R) under the homomorphism of 
Theorem 6.8. This group is known as the elliptic modular group. It acts 
discontinuously on H? but some elements have fixed points; however, it 
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contains subgroups which act freely and thus determine a manifold HT of 
curvature K = —1. Using analytic function theory or the geometry of H?, it 
is possible to show that there exist subgroups I of G acting freely and 
properly discontinuously so that H?/T is a compact manifold; in fact every 
surface of genus g > 1 can be obtained in this manner and hence every such 
surface has a Riemannian metric for which the Gaussian curvature is con- 
stant and equal to — 1. For n > 2, it is much more difficult to find subgroups 
I of the group of isometries of H” such that H”/T is a compact manifold; in 
fact this is an area of active research at present and has many interesting 
unsolved problems. 


Exercises 


1. Prove Lemma 6.1 in detail, using the known facts about curvature 
forms and equations of structure. 

2. Let abe any positive real number and let M be the subspace of R” such 
that x">0. Then the Riemannian metric on M given by g;(x)= 
(a?/(x")?) ô; has constant curvature K = —1/a°. 

3. Show that if T is a finite subgroup of O(n + 1), which has the property 
that no AcI except I, the identity, has +1 as a characteristic value, 
then I acts freely and properly discontinuously on S”. 

4. Show that a discontinuous group of isometries of a Riemannian mani- 
fold is necessarily properly discontinuous. 

5. Show that the homomorphism z: K, > SO(3) of Example 6.7 is indeed 
a homomorphism onto SO(3) with kernel +1 as claimed. 

6. Let G be a connected Lie group and K a compact subgroup, and 
suppose that G acts on G/K by left translation (in the usual way). Show 
that a subgroup I of G acts properly discontinuously on G/K if and 
only if it is discrete. Show that if I has no elements of finite order, then 
it acts freely. Apply this to G, the group of motions of E" [with 
K = O(n)]. 

7. Does a rigid motion x > Ax + b, A e O(3) and b = (b', b’, b°), of the 
space Æ?’ identified with R? have a fixed point? 

8. Show that the subgroup T of rigid motions of R? generated by transla- 
tions x > me, + ne, + pe;, m,n,peZ and e,,e,,e;, the standard 
basis, together with the motion x > Ax + 3e;, where A(e,) = e, 
A(e,) = —e,, and A(e3) = —es3, acts freely and properly discontin- 
uously on R? and that R°/T is compact. 

9. Show that vertical lines x = constant on the space H? = donee 


10. Compute If, for H” [using the natural coordinates (xt x")] and 
show that a geodesic of the submanifold H aes ote H” | 
xi =- = x"? = Q} is a geodesic of H”. 
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11. Show that the group I of linear fractional transformations of the form 
w = (az + b)/(cz + d), a, b, c, d, integers such that ad — bc = 1, acts 
properly discontinuously on H? but that it does not act freely. 

12. Let K be a real number and let p = 1 + (K/4) ?-, (x'). Prove that if 
a Riemannian metric is given on a coordinate neighborhood U, ọ of an 
n-dimensional manifold M, g(U) = B2(0), (for some £ > 0) by 


1 
gil) = P” bi; > 


then on U this metric-has constant sectional curvature K. 


Notes 


The ideas touched upon in this chapter and the previous one span the entire history of 
differential geometry, from the early work of Gauss [1] and Riemann [1] through that of 
Cartan [1] right down to the present. Given the scope of the subject, its treatment here was 
necessarily both selective and brief. However, for readers who wish to go further into some of 
the topics we have touched upon in Chapters VII and VIII, there are many excellent books 
available, of which we shall mention several. For surface theory Stoker [1], O’Neill [1], do 
Carmo [1], and Millman and Parker [1] are very helpful. These books are geometric and 
intuitive in approach yet lead directly toward the current work in manifolds of arbitrary 
dimension, whereas many other books on “classical” differential geometry do not. For the reader 
who wishes to delve more deeply into the subject of spaces of constant curvature, the book by 
Wolf [1] is an excellent source, especially for the zero and positive curvature cases. It also 
contains a very complete bibliography. A good introduction to the sort of problems one will 
encounter in spaces of negative curvature may be found in such books as those by Lehner [1] 
and Siegel [1], which deal exhaustively with the two dimensional case and its relations to 
Riemann surfaces and automorphic function theory. For Riemannian Geometry in general, the 
encyclopedic two volume work of Kobayashi and Nomizu [1] contains a wealth of information 
and a very complete bibliography. For questions concerning symmetric spaces, the reader is 
referred to Helgason [1], which also has an extensive bibliography. These, together with Milnor 
[1], Klingenberg [1], and Cheeger and Ebin [1] will give some idea of the current thrust of the 
theory and of its richness and diversity. 

Most of the current interest in Riemannian geometry is in what are known as global 
problems, which in very many cases are concerned with the relation of (often purely local) 
properties of the curvature of the Riemannian metric of the manifold to its global geometric 
structure, for example, to its topology, Euler characteristic, and so on. As an epilogue to this 
chapter we shall mention several famous results along these lines which we did not have the 
time or space to take up although they are easily accessible to the reader at this point. There are 
a number of results which draw conclusions about the manifold from the assumption that it has 
a Riemannian metric whose sectional curvatures are all of the same sign—but not necessarily 
constant. For example, if they are all greater than a positive constant e, then the manifold is 
compact. Since its universal Riemannian covering manifold necessarily has the same property, 
it must also be compact, from which we can conclude that the fundamental group of the original 
manifold is finite. On the other hand, it has been shown that if the sectional curvatures of a 
Riemannian manifold are all negative, then the universal covering manifold must be diffeomor- 
phic to R", a fact which has strong implications for the deRham groups of the manifold. Many 
beautiful results of this type may be found in Milnor’s book [1], which is highly recommended 
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for further reading. It is not evident from the two examples cited, but it is a fact that the 
influence of the curvature on the structure of the geodesics is crucial to many such results. In 
addition to Milnor’s book, the reader will find many interesting results of this sort—and with 
less emphasis on topology—given by Bishop and Crittenden [1]. 

As a final example we mention the famous classical theorem of Gauss and Bonnet which 
gives the following relation between the Gaussian curvature K and Euler characteristic y of a 
compact orientable surface M: 


2ny = | KdA (dA = area element on M). 
"M 


This has many interesting consequences. For example, if M has a Riemannian metric such that 
K > 0 everywhere, then it is homeomorphic to $°, and if the metric is such that K < 0 every- 
where, then it must have genus g > 1, that is, it must be homeomorphic toa sphere with two or 
more handles attached. A proof of this theorem is given in the books on surface theory 
referred to above. The generalization of the theorem to higher dimensions is not easy and 
requires some use of algebraic topology. In fact, this theorem resisted generalization for many 
decades and its extension to higher dimensional Riemannian manifolds by Allendoerfer-Weil 
and by Chern [3, 4]—especially Chern’s method of proof—led to many new problems in differ- 
ential geometry and to the discovery of further important relations between the Riemannian 
geometry and the topology of manifolds. 
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A*, transpose conjugate of complex Atlas, 59 
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Ad g, adjoint homomorphism, 245 
Adjoint representation of Lie group, 246 BY(M), exact forms, 274 
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Algebra of differential forms on M, /\(M), 214 Basis 
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Alternating tensor, 202 dual, 177 
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Approximation theorems, 197, 289 of covariant tensors 7"(V), 200-201 
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Bilinear form, 183-187 

induced mapping of, 185 

skew symmetric, 184 

symmetric, positive definite, 184 
Binormal to space curve, 303 
Brouwer fixed point theorem, 282 
Bundle, see Tangent bundle 


Cc 


C*, multiplicative group of complex 
numbers, 82 
C', C” differentiability class, 21-22, 66 
C*-compatibility, 52 
C” function, 22, 66 
C” mapping, 66 
C”(a), C*(p), germs of C*-functions at 
a, p, etc., 32, 107 
C”, real analytic functions, 24 
C'(U), C“(U), C*(M) differentiable 
functions on U or M, 22, 107 
Ctx), C(x), open cube of R", 2 
c(A), Jordan content of a set, 230 
Canonical basis of R”, 2 
Center of a Lie algebra, 289, 388 
Chain rule, 23 
for mappings, 27 
Change of variables in integration, 233 
Characteristic function of a set, 231 
Chart, 59 
x(M), Euler characteristic, 14, 415 
Closed differential form, 274 
Coframes, 179 
Complete integrability, 160 
Complete vector field, 142 
Components 
of a bilinear form, 184 
of a covector, 179 
Connected sum of manifolds, 258 
Connection, 317-323 
connection forms, 328, 390 
restriction of, 319 
Riemannian, 318 
Constant curvature, manifold of, 386, 
406-413 
Constant vector field, 323 
Content zero 
in R”, 230 
on a manifold, 235 
Contractible space, 270 
Contracting mapping theorem, 43 


PDF pdfFactory Pro 


INDEX 


Coordinate coframes, 179 
Coordinate frames, 110 
Coordinate neighborhcods, 52-56 
open balls and cubes, 55 
Coordinates 
local, 10 
oriented, 215 
Coordinate function, 52 
Coset space, 94 
group action on, 165 
Covariant derivative, see Differentiation of 
vector fields 
Covariant tensor field, 201-205 
induced mapping of, 202 
Covector, tangent, 178-179 
field, 178 
Covering 
locally finite, 11, 193 
refinement of, 11, 193 
regular, by spherical (cubical) 
neighborhoods, 194 
Covering manifolds, 101, 289-296 
isomorphism of, 293 
Covering map, 101 
Covering transformation, 101 
Cube, on a manifold, 237 
Curvature 
of plane curve, 305 
Riemannian, 325-327 
forms, 389 
sectional curvature, 385, 393 
symmetries of, 383 
of space curve, 302 
of surface, 18, 374 
Curve 
differentiable (C’), 22 
Cutting and pasting of manifolds, 11-14, 258 


D 


V, connection on a manifold, 317 

VY, restriction of a connection, 319 

Vx Y, covariant derivative, 314-316 

@H", 2M, boundary of H", M, 253-254 

af, ..., fA , ..., x"), Jacobian 
matrix, 26 

a/éx', natural frames of R", 36 

D, interior of D, 231 

2a), AU), algebra of derivations, 33, 40. 
108 

DF, DF(x), Jacobian matrix, 27 
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d, dy, exterior derivative, 220 
dp/dt, tangent vector to curve p(t), 126 
(D/dt) (dp/dt) = 0, equation of geodesic, 
330-331 
DX/dt, DY/dt, covariant derivative, 309 
dZ{dt, derivative of vector field, 300 
d(p, q) metric on Riemannian manifold, 
189 
ds? , metric tensor, 188 
df, differential of function, 180 
dx', coordinate coframes, 182 
Deck transformation, 101 
Dependent functions, 50 
A, distribution, 159 
ôl, Kronecker delta, 179 
de Rham group, 274 
de Rham’s Theorem, 275 
Derivation(s) 
on C%(U), 39 
into R, 33 
Derivative 
exterior, 220-227 
properties of, 220 
of vector field, 314 
Diffeomorphism, 67 
on open sets of R", 41 
Differentiable functions 
in weak sense, 21 
on Euclidean space, 21-25 
on manifolds, 65-66 
Differentiable manifold, see Manifolds 
Differentiable mappings 
composition of, 28, 67 
on Euclidean space, 25-28 
on manifolds, 65-67 
Differentiable mapping, weak sense, 26, 67 
Differentiable submanifold, see 
Submanifold 
Differentiable structure, 53 
Differential equations, systems of, 131-137 
existence theorem, proof, 174 
Differential forms, 213 
closed, 226, 274 
exact, 226, 274 
exterior, 213 
Differential of function, 180 
Differential of mapping, 109 
Differentiation of vector fields 
along curves in R", 298 
covariant differentiation, 309, 314-316 
Lie derivative, 154 
on submanifolds of R*, 307-316 
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Directional derivative as tangent vector, 
32 
Discrete group, action of, 96-100 
Distribution, on a manifold, 159 
involutive, 159-224 
local basis of, 159 
Divergence theorem, 262 
Domain of integration 
in R", 230 
on a manifold, 235 
Double of manifold with boundary, 255 
Dual basis, 177 
Dual vector space, 177 
Dynamics of moving particle, 304 


E 


E, F, G coefficients of first fundamental 
form, 242, 373 
E,, Eia, Eip coordinate frames, 30, 110, 
118 
E", Euclidean space, 4 
e'*, one-parameter group of matrices, 148 
e*, exponential of a matrix, 147 
Effective action of a group, 91 
Einstein manifold, 387 
Equivalence relation, open, 60 
Equations of structure, 391 
Euclidean space, 4-6 
Euclidean vector space, 2 
Euler characteristic, 14, 415 
Euler’s formula for surfaces, 371 
Exact differential form, 274 
Exponential of matrix, 147 
Exponential mapping 
on Lie groups, 150 
on Riemannian manifolds, 337 
Exp X,, exponential mapping, 337 
Exterior algebra, 211 
of V, A(V), 211 
induced mapping of, 213 
Exterior differential form, 213 
Exterior differentiation, 219-223 


F 
F(t), tangent vector to curve, 126 
F,, F*, linear mappings induced by F, 


109, 121, 180, 184, 202 
F, (d/dt), tangent vector to curve, 112 
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f, f expressed in local coordinates, 65 
Ag, lift of a mapping, 289 
f*g, product of paths, 268-269 
F(k, n), k-frames in V” 63 
Fixed point 

of group action, 92 

of mapping, 282 
Flags, space of, 172 
Flat coordinate neighborhood, 160 
Flow, 127 
Foliation, 164 
Forms 

bilinear, 183-187 

connection, 328-329, 390 

curvature, 390 

exterior differential, 213-214 
Frames 

coordinate, 118 

field of, 38 

orthonormal, 329 

parallel, 299, 323 

space of, 93 
Free action of group, 94 
Frenet-Serret formulas, 303 
Frobenius’ theorem, 161, 233-236 
Fundamental forms of a surface, 370 
Fundamental group, 269-270 


G 


G/H, homogeneous space, coset space, 94, 
165 
G(k, n), Grassmann manifold of k-planes 
in R", 63, 168, 362 
Gin, R), general linear group, 56 
gis, coefficients of metric tensor, 188, 320 
T, discrete group, 97 
Tt, Tix, Christoffel symbols, 313, 
321-322 
Gauss-Bonnet theorem, 415 
Gaussian curvature, 18, 375 
Genus (of surface), 14 
Geodesics, 191, 312, 330-335 
as one-parameter subgroups, 355 
minimal, 346 
Geodesic sphere, 344 
Germs of C” functions 
in R", 36 
on manifold, 115 
G-invariance, 124 
Grassman algebra, see Exterior algebra 
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Grassman manifolds, 63, 168, 362-363 

Green’s theorem, 262 

Groups, see Action of group on manifold; 
discrete groups; Lie groups 


H 


H, mean curvature of surface, 374-375 
H?, half-plane as hyperbolic space, 167, 
361, 409-412 

H*(M), H*(M), de Rham groups, 274 
Ĥ*(M), invariant de Rham groups, 285 
H” 

half space of R”, 11, 252 

hyperbolic space, 404 
Homogeneous space, 165-172 
Homomorphism, see Lie group 
Homotopy, 266-267 

of mapping, 266 

of paths and loops, 268 

relative, 267 
Homotopy operator, 277 
Hopf-Rinow theorem, 347 
Hyperbolic space, 168, 404 


I 


£, homotopy operator on /\(I x A), 277 
la, inner automorphism, 245 
I(p), 134 
Is, I, 131 
ix, 226 
Ideal, of an exterior algebra, 225 
Imbedding, of manifolds, 69-74 
in R" 
compact case, 196 
general case, 197 
Immersion of manifolds, 69-74 
Index of vector field, 140 
Infinitesimal generator, see One-parameter 
group action 
Initial conditions for differential equations, 
131 
Inner product, seealso Riemannian manifold 
on vector space, 2, 184 
Integrable functions, 231, 236 
Integrable n-form, 236 
Integral curve, 126 
Integral manifold of distribution, 159 
maximal, 163 
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Integral of function on Riemannian 
manifold, 240 
Integral of n-form on manifold, 239 
on R", 240 
Integration on manifolds 
Lie groups, 244-250 
manifolds with boundary, 251-255 
Riemannian manifolds, 240 
surfaces, 241 
Int M, 253 
Invariance of domain (Brouwer), 10, 28 
Invariance of 1-parameter subgroup (under 
diffeomorphism), 142 
Invariant forms on Lie groups, 246, 285 
Invariant forms on M, (M), 285 
Invariant metric on Lie group, 247, 353 
Invariant vector field, 120 
G-invariant, 124 
Inverse function theorem, 41-46 
Irreducible representation, 250 
Isometric surfaces, 377-379 
Isometries, group of, 353 
of Euclidean space (rigid motions), 92 
167, 408 
of H?, 361-362, 409-413 
of S", 406 
Isometry of Euclidean vector spaces, 3 
Isometry of Riemannian manifolds, 191, 
327 
local isometry, 402 
Isotropic Riemannian manifold, 386 
Isotropy subgroup, 94 
Iterated integral theorem, 232 


J 


Jacobi identity of Lie algebras, 152 
Jacobian of a mapping, 25-26 
Jordan content, 230 


K 


K, Gaussian curvature of surface, 375 

K, quaternions, 407 

K, interior of K, 193 

K(7r), sectional curvature, 385 

kı, k2, principal curvatures to surface, 370 
k,, Characteristic function of A, 231 

k(s), curvature of space curve, 302 
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&(s), curvature of plane curve, 305 
k-frame of R", 63 
Klein bottle, 13 


L 


La, Ra, Lg, left and right translations on 
Lie group, 84, 121 
Lx on A(M), 226 
LxY, Lie derivative of vector field, 154 
l, m, n, coefficients of second fundamental 
form, 369 
Lattice, integral, 85, 99 
Leibniz rule, 33 
Length of curve, see Arc length 
Lie algebra, 151 
homomorphism of, 156 
group of automorphisms of, 245 
of Lie group, 156 
of subgroup, 156 
of vector fields on manifold, 151-157 
Lie derivative, 154, 226 
Lie group, 81-89 
compact 
bi-invariant metric, 247 
bi-invariant volume, 247 
integration on, 244-250 
de Rham groups of, 285 
homomorphism of, 85 
left invariant metric, 247 
representation of, 246 
subgroup of, 87-88, 142 
Lift of mapping, 289 
Line integral, 264 
independence of path, 271 
Linear fractional transformations, 410 
Linear mapping, dual of, 177 
Linear transformations, field of, 206 
Local one-parameter group action, 127 
Loops, product of, 268-269 


M 


A(R), n x n real matrices, 56, 59 

M AR), m x n real matrices, 27, 56, 59 

M, covering manifold of M, 101, 289-295 

m(A), Lebesgue measure of set, 230 

Manifold with boundary, 11, 253 
double of, 255 
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Manifolds 
abstract, 14 
differentiable, 52-59 
imbedding in R", 196-197 
orientable, 13, 215-219 
topological, 6-11 


two-dimensional, classification of, 14 


Mappings 
of Class C”, 26, 65 


differentiable (C”) on manifolds, 65 


on R", 25 
smooth, 26, 67 

Mean value theorem, 23 
for mappings, 26 


Measure zero, set of, in R", 230 


on a manifold, 235 
Metric, Riemannian, 186 
Minimal surface, 376 
Möbius band, 13 
Monkey saddle, 376 
Motions, rigid 


(isometries) on hyperbolic plane, 410 
(isometries) on R", 92, 167, 408 


N 


N, unit normal to surface, 366 
Natural basis of 7,(R"), 2, 30 


Natural isomorphism, tangent spaces to 


R", 29 


Neighborhood, see also Coordinate 


neighborhoods 


admissible neighborhood, 101 


Norm, of vector, 3 
Normal coordinates, 339 
Normal section of surface, 371 


Normal space to submanifold, 308 


Normal vector to curve, 303 
Normal vector to surface, 366 
O(n), orthogonal! group, 85 


oO 
Q, volume element, 215-219 


Qi;, curvature forms, 390 
w!', coframes, dual basis, 178 
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w!, wi, connection forms, 329, 390-392 
One-parameter group action 
basic theorem, 136 
examples of, 139-146 
global, 123, 128 
infinitesimal generator of, 123 
local, 127 
One parameter subgroups of Lie groups, 
147-150, 355 
Orbit of group action, 92 
of one-parameter group, 125, 128 
Orbit space of group action, 93 
Order of differentiation, interchange of, 
24, 325 
Ordinary differential equations, 131-138 
172-174 
Orientation of manifold, 216 
of vector space, 215 
Oriented basis, 215 


P 


P"(R), real projective space, 15, 61 
Paracompact space, 11, 193 
Parallel curvature tensor, 401 
Parallel displacement of vector field, 311, 
323 
Parallelizable manifold, 119 
Parametrization 
of manifolds, 68 
of submanifold, 312 
of surface, 113 
Partial derivatives, 21 
differentiability and, 21-22 
independence of order, 24 
Partition of unity, 193-198 
applications of, 195-198 
Path, 268 
@”, 204 
77,(M, b), fundamental group, 270 
Planar point, 371 
Poincaré Lemma, 278 
Positive curvature, spaces of, 406 
Principal curvatures, of surface, 370-373 
Principal directions, 372 
Projective space, real, 15, 61 
Proper function, 199 
Proper mapping, 81 
Properly discontinuous action of a group, 
96, 104 
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Quaternions, 407 
Quotient space, 60 
Quotient topology, 60 


R 


R, real numbers, 1 
R, real numbers as additive group, 123 
R*, multiplicative group of real numbers, 
63, 82 
R", n-tuples of real numbers, 1-3 
R(X, Y), R(X, Y) -Z, curvature operator, 
326 
R(X, Y, Z, W), curvature tensor, 327 
Rig, Rigi, Coefficients of curvature tensor, 
327, 384 
Rank of differentiable mapping, 47, 69, 111 
Rank of linear transformation, 47 
Rank of matrix, 46 
Rank theorem, 47-49 
Real analytic function, 24 
Regular covering, by spherical (cubical) 
neighborhoods, 194 
Regular domain, 254 
Representation, of Lie group, 249-250 
orthogonal, 249 
semisimple, 250 
Restriction of a differential form, 260 
of covector, 182 
Riemann integral, properties of, 231 
Riemannian geometry, fundamental 
theorem of, 318 
Riemannian manifold, 186-192 
as metric space, 189 
differentiation on, 317 
volume element of, 219 
Riemannian metric, existence of, 195 
Rigid motions, group of, 89, 92, see 
also Isometries 


S 


©, symmetric group, 166, 203, 215 

S, symmetrizing mapping, 204 

S, S?, S”, circle, 2-sphere, n-sphere, 7, 57, 
80 

SIQ, R), acting on H?, 361 

Si(n, R), special linear group, 84 
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SO(n), special orthogonal group, 353 
S(X,), shape operator, 367-368 
Section 
of tangent bundle, 341 
on coset space, 166 
Sectional curvature, 385-386 
geometric interpretation, 393 
Semisimple Lie group, 388 
sgn o, sign of a permutation, 203 
Shape operator, 367-368 
o-compact space, 193 
Op, involutive isometry, 351 
Simple connectedness, 268 
Slice 
of coordinate neighborhood, 74, 160 
of cubical neighborhood, 74 
Smooth mapping, 26 
Smooth structure, 54 
Sphere as manifold, 57, 80 
Starlike set, 23 
Stokes’s theorem, 257 
Subgroup 
discrete, 98 
isotropy, 94, 166 
Subgroup of Lie group, 87-89 
closed, 88 
one-parameter, 143 
Submanifold, 73-78 
imbedded, 73 
immersed, 70 
open, 56 
regular, 77 
Submanifold property, 75 
Submersion, 70, 74, 122 
Support of function, 194 
Surface in Euclidean space, 14 
geometry of, 366-374 
Symmetric Riemannian manifold, 351-357 
examples of, 357-363 
Symmetries of curvature tensor, 382-384 
Symmetrizing mapping, 204 
System of differential equations, 131-139 
autonomous, 131 
existence of solutions, 131, 174 
general case, 137 
with parameters, 137 


T 


TLV), F(V), FM), 200, 202, 208 
T?, T", torus, 7, 57, 80, 82 
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T(M), T(S?), T(R"), 16, 116-117, 335-336 
T,(M), T,(R"), tangent space at point, 8, 
29, 32, 107-108 
T, N, B, tangent, normal, binormal, 303 
Tangent bundle, 16, 116-117, 335-336 
sphere bundle, 18 
Tangent covectors, 177-183 
Tangent space 
as derivations into R, 35 
at point of M, 107-108 
at point of R", 29, 32 
to surface, 114 
Tangent vector 
to curve, 112 
to manifold, 107-116 
to R*, 29-36 
Tc, translation along geodesic, 354 
t(s), torsion of space curve, 303 
Tensors, 199-205 
alternating mapping, 204 
alternating (skew symmetric), 202 
components of, 200 
exterior product of alternating, 209-214 
linear space of, 200 
multiplication of, 208-209 
symmetric, 202 
symmetrizing mapping, 204 
Tensor algebra, 208 
Tensor field, 201-202 
covariant derivative of, 396-401 
invariant, 246 
parallel, 400 
Theorema Egregium (Gauss), 18, 377 
Tietze—-Urysohn extension theorem, 289 
0! , 64 connection forms, 328 
O(t, p), 9.(p), 8,(t), action of R on M, 123 
Topological manifold, 6-10 
Toral group, 82 
Torsion, of space curve, 303 
Torus, 7, 57, 80, 92 
Transitive action of group, 92-93, 165 
Triangulable manifold, 243 


U 


U, p, coordinate neighborhood, 10, 52 
Umbilical point of surface, 371 
Universal covering space, 296 
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V*, dual space, 177 
y", vector space of n-tuples, 2 
Vector, 1-2 
norm of, 3 
tangent to manifold, 107 
Vector fields, 116-122 
along submanifold, 307 
complete, 142 
constant, 311 
F-related, 120, 121 
invariant, 120 
left invariant on Lie group, 142, 156 
Lie algebra of, 151 
on submanifold, 308 
parallel, 311 
restriction to submanifold, 118 
singular points of, 140 
on subsets of R", 37-39 
Velocity vector of moving particle, 112, 304 
Vol D, volume of D, domain of 
integration, 230 
Volume element, 219 
Volume, Riemannian, 240 


Ww 


Wedge product, 209-214 
Weierstrass approximation theorem, 197 
Whitney imbedding theorem, 197 


X 


X(U), ¥(M), C” vector fields on U or M, 
40, 122, 151 

X #f, directional derivative, 32 

LX, Y], (Lie) bracket of vector fields, 151 

X, Y, Z, vector fields, 116-122 


Z 


Z*(M), closed forms, 274 
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